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Preface to the Second Edition 


It is five years since the first edition was published, and many things have moved on 
sufficiently to justify this second edition. 

Some things never change; I have left the elementary chapters alone and I still believe 
that Appendix A on relevant mathematical methods is the correct place for you to start your 
studies. 

Some topics have matured in the last five years. Density functional theory (and especially 
the B3LYP choice of functionals) has become the workhorse of modem computational 
chemistry. I have reworked all the problems and expanded the text as appropriate. 

I have also said ‘goodbye’ to a few of the older topics. For example, everyone can now 
do chemical drawing, so I do not need to teach it. Thankfully that bane of our lives the 
Z-matrix has all but disappeared; I still have fond memories of straggling to get cyclic 
structures symmetrical and so it still gets a page of discussion. 

I have completely rewritten the chapters dealing with Monte Carlo and molecular dynam¬ 
ics, the Gn models, transition states and solvent models. I have also added a completely 
new chapter called ‘Sharing Out the Energy’, and I hope you will enjoy reading it. 

It is fashionable to have an associated website with any new teaching text, and I have 
therefore added a website at 

http://www.wileyeurope.com/college/hinchhffe 

where you will find a number of problem sets and their solutions. Feel free to use them any 
way you like. I used them in my own teaching. Perhaps you have a corresponding set that 
you would like to share with the rest of us? Let me know. 

I did all the illustrative calculations using either Gaussian 03 or HyperChem; these were 
done either on a beautiful Sony Vaio laptop or on the University of Manchester’s High 
Performance Computing parallel computer, a Bull Itanium2 system. 

As always, I welcome comments and can be reached at: Alan.Hinchliffe@manchester. 
ac.uk. 


Alan Hinchliffe 
Manchester, UK 



Preface to the First Edition 


There is nothing radically new about the techniques we use in modem molecular modelling. 
Classical mechanics hasn’t changed since the time of Newton, Hamilton and Lagrange, 
the great ideas of statistical mechanics and thermodynamics were discovered by Ludwig 
Boltzmann and J. Willard Gibbs amongst others and the basic concepts of quantum mechan¬ 
ics appeared in the 1920s, by which time J.C. Maxwell’s famous electromagnetic equations 
had long since been published. 

The chemically inspired idea that molecules can profitably be treated as a collection 
of balls joined together with springs can be traced back to the work of D.H. Andrews 
in 1930. The first serious molecular Monte Carlo simulation appeared in 1953, closely 
followed by B.J. Alder and T.E. Wainwright’s classic molecular dynamics study of hard 
discs in 1957. 

The Hartrees’ 1927 work on atomic structure is the concrete foundation of our everyday 
concept of atomic orbitals, whilst C.C.J. Roothaan’s 1951 formulation of the HF-LCAO 
model arguably gave us the basis for much of modem molecular quantum theory. 

If we move on a little, most of my colleagues would agree that the two recent major 
advances in molecular quantum theory have been density functional theory, and the elegant 
treatment of solvents using ONIOM. Ancient civilizations believed in the cyclic nature of 
time and they might have had a point for, as usual, nothing is new. Workers in solid-state 
physics and biology actually proposed these models many years ago. It took the chemists 
a while to catch up. 

Scientists and engineers first got their hands on computers in the late 1960s. We have 
passed the point on the computer history curve where every ten years gave us an order 
of magnitude increase in computer power, but it is no coincidence the that growth in our 
understanding and application of molecular modelling has mn in parallel with growth in 
computer power. Perhaps the two greatest driving forces in recent years have been the PC 
and the graphical user interface. I am humbled by the fact that my lowly 1.2 GHz AMD 
Athlon office PC is far more powerful than the world-beating mainframes that I used as a 
graduate student all those years ago, and that I can build a molecule on screen and run a 
B3LYP/6-311-I—|-G(3d,2p) calculation before my eyes (of which more in Chapter 20). 

We have also reached a stage where tremendously powerful molecular modelling com¬ 
puter packages are commercially available, and the subject is routinely taught as part of 
undergraduate science degrees. I have made use of several such packages to produce the 
screenshots; obviously they look better in colour than the greyscale of this text. 
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There are a number of classic (and hard) texts in the field; if I’m stuck with a basic 
molecular quantum mechanics problem, I usually reach for Eyring, Walter and Kimball’s 
Quantum Chemistry but the going is rarely easy. 

Equally there are a number of beautifully produced elementary texts and software refer¬ 
ence manuals that can apparently transform you into an expert overnight. It’s a two-edged 
sword, and we are victims of our own success. One often meets self-appointed experts in 
the field who have picked up much of the jargon with little of the deep understanding. It’s 
no use (in my humble opinion) trying to hold a conversation about gradients, hessians and 
density functional theory with a colleague who has just run a molecule through one package 
or another but hasn’t the slightest clue what the phrases or the output mean. 

It therefore seemed to me (and to the reviewers who read my new book proposal) that 
the time was right for a middle course. I assume that you are a ‘Beginner’ in the sense of 
Chambers dictionary, someone who begins; a person who is in the early stages of learning 
or doing anything. .., and I want to tell you how we go about modern molecular modelling, 
why we do it, and most important of all, explain much of the basic theory behind the mouse 
clicks. This involves mathematics and physics, and the book neither pulls punches nor aims 
at instant enlightenment. Many of the concepts and ideas are difficult ones, and you will 
have to think long and hard about them; if it’s any consolation, so did the pioneers in our 
subject. I have given many of the derivations in full, and tried to avoid the dreaded phrase 
‘it can be shown that’. 

There are various strands to our studies, all of which eventually intertwine. We start off 
with molecular mechanics, a classical treatment widely used to predict molecular geomet¬ 
ries. In Chapter 8, I give a quick guide to statistical thermodynamics (if such a thing is 
possible), because we need to make use of the concepts when trying to model arrays of 
particles at nonzero temperatures. Armed with this knowledge, we are ready for an assault 
on Monte Carlo and molecular dynamics. 

Just as we have to bite the bullet of statistical mechanics, so we have to bite the equally 
difficult one of quantum mechanics, which occupies Chapters 11 and 12. We then turn to 
the quantum treatment of atoms, where many of the sums can be done on a postcard if 
armed with knowledge of angular momentum. 

The Hartree-Fock and HF-LCAO models dominate much of the next few chapters, as 
they should. The Hartree-Fock model is great for predicting many molecular properties, 
but it can’t usually cope with bond breaking and bond making. Chapter 19 treats electron 
correlation and Chapter 20 deals with the very topical density functional theory (DFT). 
You won’t be taken seriously if you have not done a DFT calculation on your molecule. 
Quantum mechanics, statistical mechanics and electromagnetism all have a certain well- 
deserved reputation amongst science students; they are hard subjects. Unfortunately all 
three all feature in this new text. In electromagnetism it is mostly a matter of getting to 
grips with the mathematical notation (although I have spared you Maxwell’s beautiful equa¬ 
tions), whilst in the other two subjects it is more a question of mastering hard concepts. In 
the case of quantum mechanics, the concepts are often in direct contradiction to everyday 
experience and common sense. I expect from you a certain level of mathematical compet¬ 
ence; I have made extensive use of vectors and matrices not because I am perverse, but 
because such mathematical notation brings out the inherent simplicity and beauty of many 
of the equations. I have tried to help by giving a mathematical appendix, which should also 
make the text self-contained. 
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I have tried to put the text into historical perspective, and in particular I have quoted 
directly from a number of what I call keynote papers. It is interesting to read at first hand 
how the pioneers put their ideas across, and in any case they do it far better than me. For 
example, I am not the only author to quote Paul Dirac’s famous statement 

The underlying Physical Laws necessary for the mathematical theory of a large part of physics 
and the whole of chemistry are thus completely known, and the difficulty is only that exact 
application of these laws leads to equations much too complicated to be soluble. 

I hope you have a profitable time in your studies, and at the very least begin to appreciate 
what all those options mean next time you run a modelling package! 


Alan Hinchliffe 
Manchester, UK 
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Electric Charges and their 
Properties 


As far as we can tell, there are four fundamental types of interactions between physical 
objects. There is the weak nuclear interaction that governs the decay of beta particles, and 
the strong nuclear interaction that is responsible for binding together the particles in a 
nucleus. The familiar gravitational interaction holds the Earth very firmly in its orbit round 
the Sun, and finally we know that there is an electromagnetic interaction that is responsible 
for binding atomic electrons to nuclei and for holding atoms together when they combine 
to form molecules. 

Of the four, the gravitational interaction is the only one we would normally come across 
in our everyday world. This is because gravitational interactions between bodies always 
add. The gravitational interaction between two atoms is negligible but when large numbers 
of fundamental particles such as atoms are aggregated together, the gravitational interaction 
becomes significant. 

You may think it bizarre that there are four types of interaction, yet, conversely, you 
might wonder why there should be just four. Why not one, three or five? Should there not 
be a unifying theory to explain why there are four, and whether they are related? As I write, 
there is no such unifying theory despite tremendous research activity. 


1.1 Point Charges 

In this chapter I am going to concentrate on electric charges and their properties, since 
electrons and protons are fundamental building blocks for atoms and molecules. 

It turns out that there are two types of electric charge in nature, which we might choose 
to call type X and type Y (or Red and Blue for that matter, but X and Y will do for 
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2 Molecular Modelling for Beginners 

now). Experimental evidence shows the existence of an electrostatic force between electric 
charges; the force between two X-type charges is always repulsive, as is the force between 
two Y-type charges. The force between an X-type and a Y-type is always attractive. For 
this reason, the early experimenters decided to classify charges as positive or negative, 
because a positive quantity times a positive quantity gives a positive quantity, a negative 
quantity times a negative quantity gives a positive quantity whilst a negative quantity times 
a positive quantity gives a negative quantity. I am sure you know that the best known 
fundamental particles responsible for these charges are electrons and protons, and you are 
probably expecting me to tell you that the electrons are the negatively charged particles 
whilst protons are positively charged. It is actually just a convention that we take: we could 
just as well have called electrons positive. 

Whilst on the subject, it is fascinating to note that the charge on the electron is exactly 
equal and opposite of that on a proton. Atoms and molecules generally contain exactly 
the same number of electrons and protons, and so the net charge on a molecule is almost 
always zero. Ions certainly exist in solutions of electrolytes, but the number of Na + ions in 
a solution of sodium chloride is exactly equal to the number of Cl - ions and once again we 
are rarely aware of any imbalance of charge. 

A thunderstorm results when nature separates out positive and negative charges on a 
macroscopic scale. It is thought that friction between moving masses of air and water 
vapour detaches electrons from some molecules and attaches them to others. This results 
in parts of clouds being left with an excess of charge, often with spectacular results. It was 
investigations into such atmospheric phenomena that gave the first clues about the nature 
of the electrostatic force. 

We normally start any study of charges at rest ( electrostatics ) by considering the force 
between two point charges, as shown in Figure 1.1. The term ‘point charge ’ is a mathematical 
abstraction; obviously electrons and protons have a finite size. Just bear with me for a few 
pages, and accept that a point charge is one whose dimensions are small compared to the 
distance between them. An electron is large if you happen to be a nearby electron, but can 
normally be treated as a point charge if you happen to be a human being a metre away. 



The concept of a point charge may strike you as an odd one, but once we have established 
the magnitude of the force between two such charges, we can deduce the force between 
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any arbitrary charge distributions on the grounds that they are composed of a large number 
of point charges. 

In Figure 1.1 we have point charge Q A at position vector R A and <2 b at R B . From the 
laws of vector analysis, the vector Rab = R b — R a joins Q A to Q R , and points from Q A to 
<2 b as shown. I have indicated the direction of the vectors with arrows. 


1.2 Coulomb’s Law 


In 1785, Charles Augustin de Coulomb became the first person to give a mathematical form 
to the force between point charges. He measured the force directly between two very small 
charged bodies, and was able to show that the force exerted by Q A on Q B was 

• proportional to the inverse square of the distance between Q A and Qn when both charges 
were fixed; 

• proportional to Q A when Q R and R AB were fixed; 

• proportional to Q B when Q A and Rab were fixed. 


He also noticed that the force acted along the line joining the centres of the two charges, 
and that the force was either attractive or repulsive depending on whether the charges were 
different or of the same type. The sign of the product of the charges therefore determines 
the direction of the force. 

A mathematical result of these observations can be written in scalar form as 


F A 


oc 


Ga2b 

R 2 ab 


( 1 . 1 ) 


Forces are vector quantities, and Equation (1.1) is better written in vector form a: 

Q a Qb 


When Coulomb first estabhshed his law, he had no means of quantifying charge and so 
could not identify the proportionality constant. He took it to be unity, and thereby defined 
charge in terms of the force between charges. Modem practice is to regard charge and 
force as independent quantities, and because of this a dimensioned proportionality constant 
is necessary. For a reason that need not concern us, this is taken as l/47re 0 , where the 
permittivity of free space e 0 is an experimentally determined quantity with the approximate 
value e 0 = 8.854 x 10 -12 C 2 N -1 m -2 . Coulomb’s law is therefore 


Fa on 


1 8 a g B 

47T£ 0 K\ r 


Rab 


( 1 . 2 ) 


and it applies to measurements done in free space. If we repeat Coulomb’s experiments 
with the charges immersed in different media, then we find that the law still holds but with a 
different proportionality constant. We modify the proportionality constant using a quantity 
e r called the relative permittivity. In older texts, s r is called the dielectric constant. Our 
final statement of Coulomb’s law is therefore 


1 QaQb , 
47T£ r £o 7 ?ab 


(1.3) 
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According to Newton’s third law, we know that if Q A exerts a force F A on B on Q K , then Q B 
should exert an equal and opposite force on Q A . Coulomb’s law satisfies this requirement, 
since 


47T£ r £ 0 /?ba 


(the vector R BA points in the opposite direction to R AB and so one force is exactly the 
negative of the other, as it should be). 


1.3 Pair Wise Additivity 

Suppose we now add a third point charge Q c with position vector R c as shown in Figure 1.2. 
Since Q A and Q B are point charges, the addition of Q c cannot alter the force between Q A 
and Q b . 



Figure 1.2 Third charge added 


The total force on Q B now comprises two terms, the force due to point charge Q A and 
the force due to point charge Q c . This total force is given by 


F b 


Qb 

4ne 0 


(a. 


Rab 




(1.4) 


This may seem at first sight to be a trivial statement: surely all forces act this way. Not 
necessarily, for I have assumed that the addition of Q c did not have any effect on Q A and 
<2b (and so did not influence the force between them). 

The generic term pair wise additive describes things like forces that add as above. Forces 
between point electric charges are certainly pair wise additive, and so you might imagine 
that forces between atoms and molecules must therefore be pair wise additive, because 
atoms and molecules consist of (essentially) point charges. I am afraid that nature is not 
so kind, and we will shortly meet situations where forces between the composites of elec¬ 
trons and protons that go to make up atoms and molecules are far from being pair wise 
additive. 
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1.4 Electric Field 

Suppose now we have a point charge Q at the coordinate origin, and we place another point 
charge q at point P that has position vector r (Figure 1.3). 



Origin 


Figure 1.3 Field concept 


The force exerted by Q on q is 


I --r 


which I can rewrite trivially as 


r =(ii 


The point is that the term in brackets is to do with Q and the vector r, and contains no 
mention of q. If we want to find the force on any arbitrary q at r, we calculate the quantity 
in brackets once and then multiply by q. One way of thinking about this is to imagine that 
the charge Q creates a certain field at point r, which determines the force on any other q 
when placed at position r. 

This property is called the electric field E at that point. It is a vector quantity, like force, 
and the relationship is 

F (on q at r) = qE (at r) 


Comparison with Coulomb’s law, Equation (1.3), shows that the electric field at point r due 
to a point charge Q at the coordinate origin is 


E 


1 Qr 

4tve 0 r 3 


(1.5) 


E is sometimes written E(r) to emphasize that the electric field depends on the position 
vector r. 

Electric fields are vector fields and they are often visualized as field lines. These are 
drawn such that their spacing is inversely proportional to the strength of the field, and their 
tangent is in the direction of the field. They start at positive charges and end at negative 
charges, and two simple examples are shown in Figure 1.4. Here the choice of eight lines 
is quite arbitrary. 

Electric fields that do not vary with time are called electrostatic fields. 
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+Q -Q 

Figure 1.4 Field lines for point charges 


1.5 Work 


Look again at Figure 1.3, and suppose we move point charge q whilst keeping Q fixed 
in position. When a force acts to make something move, energy is transferred. There is a 
useful phrase in physical science that is to do with the energy transferred, and it is work. 
Work measures the energy transferred in any change, and can be calculated from the change 
in energy of a body when it moves through a distance under the influence of a force. 

We have to be careful to take account of the energy balance. If a body gains energy, this 
energy has to come from somewhere, and that somewhere must lose energy. What we do 
is to divide the universe into two parts: the bits we are interested in called the system and 
the rest of the universe that we call the surroundings. 

Some texts focus on the work done by the system, some concern themselves with the 
work done on the system. According to the law of conservation of energy, one is exactly 
the equal and opposite of the other, but we have to be clear which is being discussed. I am 
going to write w on for the work done on our system. 

If the system gains energy, then w m will be positive. If the system loses energy then w on 
will negative. 

We also have to be careful about the phrase ‘through a distance’. The phrase means 
‘through a distance that is the projection of the force vector on the displacement vector’, 
and you should instantly recognize a vector scalar product (see Appendix A). 

A useful formula that relates to the energy gained by a system (i.e. w on ) when a constant 
force F moves its point of application through 1 is 

w 0D - —F.l (1.6) 


In the case where the force is not constant, we have to divide up the motion into differential 
elements dl. The energy transferred is then given by the sum of all the corresponding 
differential elements dw on . The corresponding formulae are 


dw on = —F.dl 

w m = ~f F-dl 


(1.7) 


We now move q by an infinitesimal vector displacement dl, as shown in Figure 1.5, s< 
that it ends up at point r + dl. The work done on the system in that differential change is 


dw on = -F.dl 


If the angle between the vectors r : and dl is 9, then we have 


dw 0 „ = — Fdlcosd 
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and examination of Figure 1.6 shows that dl cos 9 is the radial distance moved by charge 
q, which we will write dr. 



Figure 1.5 Electrostatic work 



Origin 


Figure 1.6 Relationship between vectors 


Hence 


dw on =-— ^dr 

47te 0 r 2 


The total work done moving from position I to position II is therefore found by integrating 


47re 0 J r 2 

4jr£ 0 \r n n) 


The work done depends only on the initial and final positions of charge q\ it is independent 
of the way we make the change. 

Another way to think about the problem is as follows. The force is radial, and we can 
divide the movement from position I to position II into infinitesimal steps, some of which 
are parallel to F and some of which are perpendicular to F. The perpendicular steps count 
0 towards w on , the parallel steps only depend on the change in the (scalar) radial distance. 
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1.6 Charge Distributions 

So far I have concentrated on point charges, and carefully skirted round the question as to 
how we deal with continuous distributions of charge. Figure 1.7 shows a charge distribution 
<2 a - The density of charge need not be constant through space, and we normally write p(r) 
for the density at the point whose position vector is r. The charge contained within the 
volume element dr at r is therefore p(r)dr and the relationship between p(r) and Q A is 
discussed in Appendix A. It is 



(1.9) 


In order to find the force between the charge distribution and the point charge Q B we simply 
extend our ideas about the force between two point charges; one of the point charges being 
p(r)dr and the other Q B . 



Origin 


Ob 


Figure 1.7 Charge distribution 


The total force is given by the sum of all possible contributions from the elements of the 
continuous charge distribution Q A with point charge Q B . The practical calculation of such 
a force can be a nightmare, even for simple charge distributions. One of the reasons for the 
nightmare is that forces are vector quantities; we need to know about both their magnitude 
and their direction. 

In the next section, I am going to tell you about a very useful scalar field called the 
mutual potential energy U. This field has the great advantage that it is a scalar field, and so 
we do not need to worry about direction in our calculations. 


1.7 Mutual Potential Energy, U 

Suppose now we start with charge q at infinity, and move it up to a point with vector position 
r, as shown in Figure 1.3. The work done is 
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and this represents the energy change on building up the charge distribution, with the 
charges initially at infinite separation. It turns out that this energy change is an important 
property, and we give it a special name (the mutual potential energy) and a special symbol 
U (occasionally <£). 

Comparison of the equations for force, work and mutual potential energy given above 
suggests that there might be a link between the force and the mutual potential energy; at 
first sight, one expression looks like the derivative of the other. 

I am going to derive a relationship between force and mutual potential energy. The 
relationship is perfectly general; it applies to all forces provided that they are constant 
in time. 


1.8 Relationship between Force and Mutual Potential Energy 


Consider a body of mass m that moves in (say) the x-direction under the influence of a 
constant force. Suppose that at some instant its speed is v. The kinetic energy is \mv 2 . You 
are probably aware of the law of conservation of energy, and know that when I add the 
potential energy U to the kinetic energy, I will get a constant energy that I will denote e 
(sorry about the clash of symbols with electric permittivity, but that is life): 

e = ^mv 2 + U (1.11) 

I want to show you how to relate U to the force F. If the energy e is constant in time, 
then de/d t = 0. Differentiation of Equation (1.11) with respect to time gives 
de dv d U 

d t mV dt df 


and so, by the chain rule 

de dv dU dx 
dt WV dt dx dt 

If the energy e is constant then its first differential with respect to time is zero, and v is just 
dx/dt. Likewise dv/df is the acceleration and so 


0 = 


dU\ dx 
dx/dt 


( 1 . 12 ) 


Equation (1.12) is true if the speed is zero, or if the term in brackets is zero. According to 
Newton’s second law of motion, mass times acceleration is force, and so 


F 


dU 

"fa 


(1.13) 


which gives us the link between force and mutual potential energy. 

When working in three dimensions, we have to be careful to distinguish between vectors 
and scalars. We treat a body of mass m whose position vector is r. The velocity is v — dr/df 
and the kinetic energy is \m (dr/df)(dr/df). 

Analysis along the lines given above shows that the force F and U are related by 


F = —grad U 


(1.14) 
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where the gradient of U is discussed in Appendix A and is given in Cartesian coordinates by 


dU 

Ik** 


(1.15) 


Here e x , e v and e„ are unit vectors pointing along the Cartesian axes. 


1.9 Electric Multipoles 

We can define exactly an array of point charges by listing the magnitudes of the charges, 
together with their position vectors. If we then wish to calculate (say) the force between one 
array of charges and another, we simply apply Coulomb’s law (Equation (1.3)) repeatedly 
to each pair of charges. Equation (1.3) is exact, and can be easily extended to cover the 
case of continuous charge distributions. 

For many purposes, it proves more profitable to describe a charge distribution in terms 
of certain quantities called the electric moments. We can then discuss the interaction of one 
charge distribution with another in terms of the interactions between the electric moments. 

Consider first a pair of equal and opposite point charges, +Q and —Q separated by 
distance R (Figure 1.8). This pair of charges is usually said to form an electric dipole of 
magnitude QR . In fact, electric dipoles are vector quantities and a more rigorous definition is 

Pe = QR (1.16) 

where the vector R points from the negative charge to the positive charge. 



We sometimes have to concern ourselves with a more general definition, one relating to 
an arbitrary array of charges such as that shown in Figure 1.9. We have four point charges: 
<2i whose position vector is R 1; Q 2 whose position vector is R 2 , Qi whose position vector 
is R 3 and Q A whose position vector is R 4 . We define the electric dipole moment p e of these 
four charges as 

Pe = X>R, 

It is a vector quantity with x, y and z Cartesian components 

Y QiXi, Y Q‘ v ‘ and & Z ‘ 
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Figure 1.9 Generalized electric dipole 


This is consistent with the elementary definition given above; in the case of two equal and 
opposite charges +Q and — Q a distance d apart, the electric dipole moment has magnitude 
Qd and points from the negative charge to the positive. The generalization to n charges is 
obvious: we substitute n for 4 in the above definition. 

There are several general rales about electric moments of charge distributions, and we 
can learn a couple of the ones that apply to dipole moments by considering the simple 
arrays shown in Figure 1.10 and keeping the definitions in mind. 


+0 -Q 


(2) 


. +Q 

(1) 

+0 +0 
(3) 


+Q -2 Q +Q 


(4) 


(5) 


(6) 


Figure 1.10 Simple arrays of point charges 
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I have joined up the charges with lines in order to focus attention on the charge systems 
involved; there is no implication of a ‘bond’. We do not normally discuss the electric 
dipole due to a point charge (1). Examination of the charge distributions (2) through (6) 
and calculation of their electric dipole moment for different coordinate origins suggest the 
general result; neutral arrays of point charges have a unique electric dipole moment that 
does not depend on where we take the coordinate origin. Otherwise, we have to state the 
coordinate origin when we discuss the electric dipole moment. 

I can prove this from Equation (1.16), generalized to n point charges: 

Pe = X>R, 

Suppose that we move the coordinate origin so that each point charge <2, has 
vector R' where 

R, = R' + A 

with A a constant vector. From the definition of electric dipole moment we have 

P e = X>R ( 

and so, with respect to the new coordinate origin 

P>X>R', 

= J2 Qi (R« - A) 

= Pe - A J]<2, 

The two definitions only give the same vector if the sum of charges is zero. We often 
use the phrase gauge invariant to describe quantities that do not depend on the choice of 
coordinate origin. 

Arrays (5) and (6) each have a centre of symmetry. There is a general result that any 
charge distribution having no overall charge but a centre of symmetry must have a zero 
dipole moment, and similar results follow for other highly symmetrical arrays of charges. 

1.9.1 Continuous Charge Distributions 

In order to extend the definition of an electric dipole to a continuous charge distribution 
such as that shown in Figure 1.7, we first of all divide the region of space into differential 
elements dr. If p(r) is the charge density then the change in volume element dr is p(r)dr. 
We then treat each of these volume elements as point charges and add (i.e. integrate). The 
electric dipole moment becomes 


(1.17) 

position 


Pe = / rp (r) dr 


(1.18) 
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1.9.2 Electric Second Moment 

The electric dipole moment of an array of point charges is defined by the three sums 


EGA, J2Q iYi and E GA 

and we can collect them into a column vector in an obvious way ai 

' EGA ' 

Pc = EGA 

t 5 flZ ‘ , 

The six independent quantities 

Ega?, E gaa, E gaa, • • ■. E^ z f 

are said to define the electric second moment of the charge distribution. We usually collect 


< 3 matrix q 



EGA 2 

E GAT 

E GAA 

E GAA 

EGA 2 

E GAA 

E GAA 

E GAA 

EGA 2 


The matrix is symmetric because of the obvious equalities of the off-diagonal sums such as 
E GAT and E GAA 


There are unfortunately many different definitions related to the second (and higher) 
moments in the literature. There is little uniformity of usage, and it is necessary to be 
crystal clear about the definition and choice of origin when dealing with these quantities. 

Most authors prefer to work with a quantity called the electric quadrupole moment rather 
than the second moment, but even then there are several different conventions. A common 
choice is to use the symbol 0 e and the definition 


1 t G, (3A - R]) 

3EGAA 

V 


3 E GAA 


3 E GAA 
EG,-(3*?-*?) 
3 E GAT 


3 E GAA, N 
3 E GAA 
±Q t { 32?-*?) ^ 


( 1 . 21 ) 


Notice that the diagonal elements of this matrix sum to zero and so the matrix has zero trace 
(the trace being the sum of the diagonal elements; see Appendix A). Some authors do not 
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use the factor of '/ 2 in their definition. Quadrupole moments are gauge invariant provided 
the electric dipole moment and the charge are both zero. 

Figure 1.11 shows an octahedrally symmetrical array of point charges. Each point charge 
has magnitude Q, apart from the central charge that has magnitude —6 Q in order to make 
the system neutral. The distance between each axial point charge and the central one is a. 


Q 



Q 

Figure 1.11 Octahedral charge distribution 


If I choose to direct the Cartesian axes along the symmetry axes, then the second moment 
matrix is 

( 2 0 0 \ 

0 2 0 

0 0 2 / 

whilst the quadrupole moment matrix is zero. 

If I now reduce the symmetry of the charge distribution by placing charges 2 Q along the 
vertical axis (taken for the sake of argument as the x-axis) and —HQ at the centre (to keep 
the electrical balance), the second moment matrix becomes 

/ 4 0 0 \ 

Q e = Qa 2 ( 0 2 0 

\ 0 0 2 / 

whilst the quadrupole moment matrix is now 

( 2 0 0 \ 

0-1 0 
0 0 -1 ) 

The electric quadrupole moment measures deviations from spherical symmetry. It is zero 
when the charge distribution has spherical symmetry. 

It always has zero trace (because of the definition), but it is not always diagonal. 
Nevertheless, it can always be made diagonal by a rotation of the coordinate axes. 
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Finally, consider a linear array formed by the top (+0, central (+20 and lower charges 
(-30. We find 

/ —2 0 0 \ / —2 0 0 \ 

q e = Qa 2 l 0 0 0 1 and 0 e = Qa 2 I 0 10 

\ 0 0 0 / \ 0 0 1 / 

In cases where the symmetry of the problem determines that the second moment tensor 
only has one nonzero component, we speak colloquially of the second moment (which in 
this case is —2 Qa 2 ). 

1.9.3 Higher Electric Moments 

The set of ten independent quantities 

X>xf, i ym,..., x>zf 

defines the electric third moment of the charge distribution, and so on. We rarely encounter 
such higher moments of electric charge in chemistry. 


1.10 Electrostatic Potential 

Electrostatic forces are vector quantities, and we have to worry about their magnitude and 
direction. I explained earlier that it is more usual to work with the mutual potential energy 
U rather than the force F, if only because U is a scalar quantity. In any case we can recover 
one from the other by the formula 

F = —grad U 

Similar considerations apply when dealing with electrostatic fields. They are vector fields 
with all the inherent problems of having to deal with both a magnitude and a direction. It 
is usual to work with a scalar field called the electrostatic potential <p. This is related to the 
electrostatic field E in the same way that U is related to F: 

E = —grad <t> 

We will hear more about the electrostatic potential in later sections. In the meantime, I will 
tell you that the electrostatic potential at field point r due to a point charge Q at the coordinate 
origin is 

4>( r) = -r~- ( 1 . 22 ) 

4ne 0 r 

The electric field and the electrostatic potential due to an electric dipole, quadrupole and 
higher electric moments are discussed in all the elementary electromagnetism texts. The 
expressions can be written exactly in terms of the various distances and charges involved. 
For many applications, including our own, it is worthwhile examining the mathematical 
form of these fields for points in space that are far away from the charge distribution. We 
then refer to (for example) the ‘small’ electric dipole and so on. 
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The electrostatic potential at field point r due to a small electric dipole p e at the coordinate 
origin turns out to be 


47T£ 0 r 3 


(1.23) 


which falls off as Mr 2 . It falls off faster with r than the potential due to a point charge 
because of the cancellation due to plus and minus charges. This is in fact a general rule: the 
electrostatic potential for a small electric multipole of order l falls off as r~ ( ' +l) so dipole 
moment potentials fall off faster than those due to point charges, and so on. 


1.11 Polarization and Polarizability 

In electrical circuits, charges are stored in capacitors, which at their simplest consist of a 
pair of conductors carrying equal and opposite charges. Michael Faraday (1837) made a 
great discovery when he observed that filling the space between the plates of a parallel 
plate capacitor with substances such as mica increased their ability to store charge. The 
multiplicative factor is called the relative permittivity and is given a symbol e r , as discussed 
above. I also told you that the older name is the dielectric constant. 

Materials such as glass and mica differ from substances such as copper wire, in that they 
have few conduction electrons and so make poor conductors of electric current. We call 
materials such as glass and mica dielectrics, to distinguish them from metallic conductors. 

Figure 1.12 shows a two-dimensional picture of a dielectric material, illustrated as 
positively charged nuclei each surrounded by a localized electron cloud. 

© © © ® © 

© © © © © 

© © © © © 

© © © © © 

Figure 1.12 Dielectric slab 

We now apply an electrostatic field, directed from the left to the right. There is a force on 
each charge, and the positive charges are displaced to the right whilst the negative charges 
move a corresponding distance to the left, as shown in Figure 1.13. 

The macroscopic theory of this phenomenon is referred to as dielectric polarization, and 
we focus on the induced dipole moment dp e per differential volume dr. Because it is a 
macroscopic theory, no attention is paid to atomic details; we assume that there are a large 
number of atoms or molecules within the volume element dr (or that the effects caused by 
the discrete particles have somehow been averaged out). 


Electric Charges and their Properties 17 


0 0 0 0 ® 

0 0 0 0 ® 

0 0 0 0 ® 

0 0 0 0 ® 


Electric field 

Figure 1.13 Dielectric with applied field 

We relate the induced electric dipole to the volume of a differential element by 

dp e = Pdr (1.24) 

where the dielectric polarization P is an experimentally determined quantity. P can depend 
on the applied field in all manner of complicated ways, but for very simple media and for 
low field strengths, it turns out that P is directly proportional to E. We write 

P=(e r -l)e 0 E (1.25) 

where e r is the relative permittivity of the dielectric. The polarization acts so as to reduce 
the field inside a dielectric. 


1.12 Dipole Polarizability 

At the microscopic level, we concern ourselves with the various electric moments that are 
induced in each atom or molecule. 

Consider the simple case shown in Figure 1.14, where we apply a weak electric field E in 
a direction parallel to a molecule’s electric dipole moment. This causes charge redistribution 
and we can write 

p e (induced) = p e (permanent) + aE (1.26) 

I have distinguished between the permanent electric dipole, the one a molecule has in free 
space with no fields present, from the induced dipole. I have also used the symbol ol for the 
dipole polarizability. 

In the general case, the field need not be weak and the induced dipole need not be in the 
same direction as either the applied field or the permanent dipole moment. This is shown 
in Figure 1.15. Parameter a cannot be a scalar, since the directions of the applied field and 
the induced dipole need not be the same. 
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Electric field 

Figure 1.14 Induced molecular dipole 



Electric field 


Figure 1.15 More general case 

The dipole polarizability a is a special type of physical property called a tensor, just like 
the electric second moment. We can represent a as a 3 x 3 real symmetric matrix 

( a xx a xy a xz \ 

Uyx dyy Uyz I (1-27) 

a a a zy a zz J 

and I will write the more general expression as 

p e (induced) = p e (permanent) + a.E + higher order terms (1.28) 

The higher order terms are the hyperpolarizabilities', they feature in advanced texts of 
this kind. We are not going to meet them again. We interpret Equation (1.28) as a matrix 
equation; the p’s are column vectors. 


1.12.1 Properties of Polarizabilities 

The matrix a can always be written in diagonal form by a suitable rotation of the Cartesian 
axes (see Appendix A) to give 
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The quantities a m , a bb and a cc are called the principal values of the polarizability tensor. For 
molecules with symmetry, the principal axes of polarizability correspond to the molecular 
symmetry axes. For a linear molecule the components that refer to perpendicular axes 
are equal and usually different from the parallel component and the matrix is usually 
written 

(«- 0 0 \ 

a = 0 a bb 0 

\ 0 0 a bb ) 


1.13 Many-Body Forces 


It is instructive to calculate the work done in building up an array of charges such as that 
shown in Figure 1.9. We will assume that all the charges are point charges and so cannot be 
polarized. We start with all the charges (n in total) at infinity, and move them in turn from 
infinity to their position as shown. 

Moving Q x from infinity to position vector R, takes no energy, because no other charges 
are present. 

Moving Q 2 from infinity to position vector R 2 involves an energy change: 


i Qii 

47re 0 R x 


where 7?i 2 is the length of the vector joining the charges. We now move Q 3 from infinity to 
position vector R 3 . This charge moves in the field due to <2, and Q 2 , involving an energy cost: 


_ l QiQ 3 l Q2Q3 

4ire 0 R l3 4 -jt s 0 R 23 


The total work done U tot is seen to be 


<'~=EX> 


(1.29) 


This expression holds because of the pair wise additivity of the forces between point 
charges. The expression would not necessarily hold if the charges were themselves charge 
distributions because the addition of further charges could polarize the existing ones and 
so alter the forces already calculated. 

Whatever the case, C/ tot will depend on the coordinates of all the charges present: 

U tot = U mt (R,,R 2 ,...,R„) (1.30) 

and it is always possible to write a formal expansion 

U tot (R, ,R 2 . ..., R„) = J2 U<2) ( R U R ,) + J2 Ui3) ( R ” R R < ) + ■ ■ ■ 

pairs triples 

+ U w (R l5 R 2 ,... ,R„) 


(1.31) 
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involving distinct pairs, triples, etc. of particles. The U {2) terms are referred to as the pah- 
contributions, the L/ <3) terms are the three-body terms, and so on. Terms higher than the 
second are identically zero for the interaction between point charges. 


1.14 Problem Set 

A set of problems, together with solutions, is available on the associated website. 


2 

The Forces between Molecules 


When molecules are near enough to influence one another, we need to concern ourselves 
with the balance between the forces of attraction and repulsion. We know that such attractive 
forces exist, for otherwise there would be nothing to bring molecules together into the solid 
and liquid states, and all matter would be gaseous. A study of the forces between atomic or 
molecular species constitutes the subject of intermolecular forces. 

People have speculated about the nature of intermolecular forces ever since the ideas of 
atoms and molecules first existed. Our present ideas, that molecules attract at long range 
but repel strongly at short range, began to emerge in the nineteenth century due to the 
experimental work of Rudolf J.E. Clausius and Thomas Andrews. It was not until the early 
twentieth century, when the principles of quantum mechanics became established, that we 
could truly say that we understood the detailed mechanism of intermolecular forces. 

Although we talk about intermolecular forces, it is more usual and convenient to focus 
on the mutual potential energy, discussed in Chapter 1. If we start with two argon atoms 
at infinite separation, then their mutual potential energy at separation R tells us the energy 
change on bringing the atoms together to that distance from infinity. 

Even for the simplest pair of molecules, the intermolecular mutual potential energy will 
depend on their relative orientations in addition to their separation. Perhaps you can now 
see why the study of intermolecular forces has taken so much effort by so many brilliant 
scientists, over very many years. 


2.1 Pair Potential 

So, to start with, we concern ourselves with two atomic or molecular species A and B, and 
ask how they interact. No chemical reaction is implied, and I should say straightaway that 
I am not going to be concerned with bond making and bond breaking in this chapter. That is 
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the subject of valence theory. In the (unusual) case that the two species A and B concerned 
are ions, you may think that the problem is more or less solved. We simply calculate their 
mutual Coulomb potential energy as discussed in Chapter 1: 


_ 1 QaQb 

471 £(, Rab 


( 2 . 1 ) 


You would certainly be on the right lines in this approach except that ions are not point 
charges and they can be polarized just like any other continuous charge distribution. But as 
I explained in Chapter 1, we rarely have to concern ourselves with ions, and to get started 
we will consider the very simple case of a pair of like atoms (such as two argon atoms). We 
know from experiment that the general form of their mutual potential energy must be that 
shown in Figure 2.1. This curve is meant to be schematic, and I have been careful not to 
quantify the axes. It has all the right features but we have yet to discover its precise form. 
We very often speak about the pair potential, and write it U (R), where R is the separation 
between the two atoms. The zero of U(R), denoted by the horizontal line, is commonly 
taken to refer to the two atoms at infinity. We often characterize the curve in terms of a 
small number of parameters; for example, the collision diameter a being the distance at 
which U ( R ) = 0, the minimum R min and minus the value of U (R) at R min (often written e, 
and as defined is a positive quantity). 



Figure 2.1 Schematic argon-argon interaction 
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We now need to investigate more closely the precise form of this pair potential. The 
potential comprises a repulsive part (important for small R ) and an attractive part (important 
for large R ). It turns out that there are three major contributions to the attractive part, as we 
will see below. 


2.2 Multipole Expansion 

Suppose that we have two molecules with centres a distance R apart (Figure 2.2). The 
distance R is taken to be large compared with a molecular dimension. 



Each molecule consists of a number of charged particles, and in principle we can write 
down an expression for the mutual potential energy of these two molecules in terms of the 
pair potentials between the various point charges. The basic physical idea of the multipole 
expansion is to make use of the fact that several of these particles go to form molecule A, 
and the remainder to form molecule B, each of which has a distinct chemical identity. We 
therefore seek to write the mutual potential energy of A and B in terms of properties of the 
two molecular charge distributions and their separation. 


2.3 Charge-Dipole Interaction 

I can illustrate the ideas by considering an elementary textbook problem, the mutual 
potential energy of a simple small electric dipole and a point charge (Figure 2.3). 

Suppose then that we have a simple dipole consisting of a pair of charges Q A and Q B 
aligned along the horizontal axis and equally separated from the coordinate origin by 
distance d. We introduce a third charge Q as shown, with the scalar distance R from the 
origin. This point charge makes an angle 9 with the electric dipole, as shown. 


24 


Molecular Modelling for Beginners 


Q 



Qa origin Qb 

Figure 2.3 Charge-dipole interaction 


The two point charges Q k and <2b have a mutual potential energy of 

1 QkQb 
47re 0 2d 


( 2 . 2 ) 


but we are going to investigate what happens to the mutual potential energy of the system 
as we change the position vector of Q and so we ignore this term since it remains constant. 

The mutual potential energy f/(charge-dipole) of the point charge and the electric dipole 
is given exactly by 

U (charge-dipole) = ^~Q (j^ + (2-3) 

This can also be written in terms of R and 0 as 

U (charge-dipole) = ——Q ( - H— — | 

47re 0 V V(/? 2 + d 2 + 2dR cos 0) Q{R 2 + d 2 - 2dRcos9) ) 


(2.4) 


and, once again, this is an exact expression. 

In the case where the point charge Q gets progressively far away from the coordinate 
origin, we can usefully expand the two denominators using the binomial theorem to give 

1 / (0 A + (2b) (<2b - <2a) d 

U (charge-dipole) = -- Q (- - + ---cos 9 

4tT£q \ R R 2 

+ «^(W, —1) + .-) (2.5) 

The first term on the right-hand side contains the sum of the two charges making up the 
dipole. Very often, we deal with simple dipoles that carry no overall charge, and this term 
is zero because Q A = —Qb- 

The second term on the right-hand side obviously involves the electric dipole moment, 
whose magnitude is {Qb — Q A )d. The third term involves the electric second moment whose 
magnitude is {Qb + Q\)d 2 , and so on. The mutual potential energy is therefore seen to be 
a sum of terms; each is a product of a moment of the electric charge, and a function of 
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the inverse distance. Hopefully, as R increases, the magnitude of the successive terms will 
become less and eventually the mutual potential energy will be dominated by the first few 
terms in the expansion. 

In the more general case where we replace Q A and Q B with an arbitrary array of point 
charges Qi, Q 2 , ■ ■ ■ ,Q„ whose position vectors are R,, R 2 , . . . , R„ (or for that matter a 
continuous charge distribution), it turns out that we can always write the mutual interaction 
potential with Q as 

U = | (kj ^ - ^22 2' R 'j S rad Q) + M g her order terms J ( 2 - 6 ) 

The first summation on the right-hand side gives the overall charge of the charge distribution, 
the second term involves the electric dipole moment, the third term involves the electric 
quadrupole moment, and so on. 


2.4 Dipole-Dipole Interaction 

Consider now a slightly more realistic model for the interaction of two simple (diatomic) 
molecules (Figure 2.4). Molecule A consists of two point charges, Q 1A and Q 2A . Molecule 
B consists of two point charges <2ib and Q 2B . The overall charge on molecule A is therefore 
Qa = Qia + Q.2A with a similar expression for molecule B. The electric dipole moments of 
A and B are written p A , p B in an obvious notation, and their scalar magnitudes are written 
p A ,p B . The second moments of the two molecules are each determined by a scalar value 
q A and q B , simply because they are linear. 


d?2A 



Figure 2.4 Multipole expansion for a pair of diatomics 

Molecule A is centred at the origin, whilst molecule B has its centre a distance R away 
along the horizontal axis. The inclinations to the axis are 0 A and 0 B , and cj) gives the relative 
orientation of the two molecules. The distance between the two molecules is much less than 
their separation, so we can make convenient approximations such as for the small dipole. 
After some standard analysis we find that the mutual potential energy of A and B is 
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(47T£ 0 ) Uab = + -1 (Q B p A cos 6 A - QaPb cos 6 b ) 

— (2 cos 9 A cos (9 b — sin 0 A sin 0 B cos <p) 

+ (6a<7b (3 cos 2 9 b — l) + Q B q A (3 cos 2 0 A — l)) 4— • (2.7) 

The physical interpretation is as follows. The first term on the right-hand side gives the 
mutual potential energy of the two charged molecules A and B. The second term gives 
a contribution due to each charged molecule with the other dipole. The third term is a 
dipole-dipole contribution, and so on. 

If A and B correspond to uncharged molecules, then the leading term is seen to be the 
dipole-dipole interaction 

(4jr£ 0 ) lVdip...d, P = ( 2 cos cos 08 ~ sin sin 08 cos V) (2.8) 

The sign and magnitude of this term depend critically on the relative orientation of the two 
molecules. Table 2.1 shows three possible examples, all of which have 0 = 0. 


Table 2.1 Representative dipole-dipole terms for two diatomics 


9a 

0 R Relative orientations 

Expression for dipole-dipole U 

0 

0 Parallel 

2p A p B 



4: re 0 /? 3 

0 

n Antiparallel 

2 p A p B 

4 jre 0 R 3 

0 

tt/ 2 Perpendicular 

0 


2.5 Taking Account of the Temperature 

We now imagine that the two molecules undergo thermal motion; we keep their separation 
R constant but allow the angles to vary. The aim is to calculate the average dipole-dipole 
interaction. Some orientations of the two dipoles will be more energetically favoured than 
others and we allow for this by including a Boltzmann factor exp(— UD^T), where k B is 
the Boltzmann constant and T the thermodynamic temperature. It is conventional to denote 
mean values by angle brackets < ... > and the mean value of the dipole-dipole interaction 
is given formally by 


( f^A B )dip...dip 


f t/ " exp (~^) ^ 

/eXp (-^f) dl 


(2.9) 
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The integral has to be done over all possible values of the angles, keeping R fixed. After 
some standard integration, we find 


llj \ 2 P 2 \Pb 1 

3k B r (4 jt£ 0 ) 2 R 6 


( 2 . 10 ) 


The overall value is therefore negative, and the term is inversely dependent on the 
temperature. It also falls off as l/R 6 . 


2.6 Induction Energy 

The next step is the case of two interacting molecules, one of which has a permanent dipole 
moment and one of which is polarizable but does not have a permanent electric dipole 
moment. 

Figure 2.5 shows molecule A with a permanent dipole moment p A . I have indicated the 
direction of p A in the diagram, and an arbitrary point P in molecule B. The dipole p A is 
distant R from point P, and makes an angle 6 as shown. The molecules are sufficiently far 
apart for the precise location of the point P inside the second molecule to be irrelevant. 


Molecule A 



polarizability a B 

Figure 2.5 Dipole-induced dipole 


The basic physical idea is that the electric dipole p A induces a dipole in molecule B, 
since B is polarizable. We evaluate the potential energy involved and finally average over 
all possible geometrical arrangements, for a fixed value of the intermolecular separation. 
The steps involved are as follows. The electrostatic potential due to the small dipole p A is 


<Pa (R) = 


1 PaR 

47T£o R 3 


This is related to the electrostatic field by the general formula 


E (R) = —grad </> (R) 
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and direct differentiation gives the following formula: 


E a (R) = - 


47T£o 


Pa _ PaR 

fl 3 R 5 


R 


( 2 . 11 ) 


Molecule A therefore generates an electrostatic field in the region of molecule B, according 
to the above vector equation. The modulus of this vector at point P is 


Ea = t^-^V(1 + 3cos 2 0) 
47T£ 0 R 3 


( 2 . 12 ) 


This electrostatic field induces a dipole in molecule B. For the sake of argument, I will 
assume that the induced dipole is in the direction of the applied field (and so we need not 
worry about the fact that the polarizability is a tensor property). 

A calculation of the resulting mutual potential energy Uab gives 

Uab = -—=■ ~ ( 3 cos2 6 + !) (2.13) 

(4tt£ 0 ) 2 R 6 2 V ’ 


Polarizabilities are positive quantities and so U AR is negative for all values of 9 at a given 
intermolecular separation. This is quite different from the dipole-dipole interaction, where 
some alignments of the dipoles gave a positive contribution to the mutual potential energy 
and some gave a negative one. 

Finally we have to average over all possible alignments keeping the internuclear separ¬ 
ation fixed. This averaging has again to be done using the Boltzmann weightings, and we 
find eventually an expression for the induction contribution to the mutual potential energy 
of A and B: 


(Uab)^ 


fig As* b 

(4n-£ 0 ) 2 R 6 


(2.14) 


Note that the interaction falls off as 1 IR 6 just as for the dipole-dipole interaction, but 
this time there is no temperature dependence. For two identical A molecules each with 
permanent electric dipole p A and polarizability a A the expression becomes 


(t/AA>md 


jg . 

(4n-£ 0 ) 2 R 6 


(2.15) 


This of course has to be added to the dipole-dipole expression of the previous section. 


2.7 Dispersion Energy 

It is an experimental fact that the inert gases can be liquefied. Atoms do not have permanent 
electric moments, so the dipole-dipole and induction contributions to the mutual potential 
energy of an array of inert gas atoms must both be zero. There is clearly a third interaction 
mechanism (referred to as dispersion), and this was first identified by Fritz W. London in 
1930. 

The two contributions to the mutual potential energy discussed in previous sections 
can be described by classical electromagnetism. There is no need to invoke the concepts 
of quantum mechanics. Dispersion interactions can only be correctly described using the 
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language of quantum mechanics. Nevertheless, the following qualitative discussion is to 
be found in all elementary texts: 

The electrons in an atom or molecule are in continual motion, even in the ground state. So, 
although on average the dipole moment of a spherically symmetrical system is zero, at any 
instant a temporary dipole moment can occur. This temporary dipole can induce a further 
temporary dipole in a neighbouring atom or molecule and, as in the case of the inductive 
interaction, the net effect will be attractive. 


Paul K. L. Drude gave a simple quantum mechanical description of the effect, and his 
theory suggests that the dispersion contribution can be written 



(2.16) 


The first term (that I have written D 6 ) is to be identified with the instantaneous dipole- 
induced dipole mechanism. The higher terms are caused by instantaneous quadrupole- 
induced quadruples, etc. According to Drude’s theory 


In this expression, e, is the first excitation energy of the atomic or molecular species con¬ 
cerned. Once again I apologise for the clash of symbols, for e 0 is of course the permittivity 
of free space. 

The dispersion energy is again seen to be attractive and to fall off as HR 6 . 


2.8 Repulsive Contributions 

When two molecular species approach so closely that their electron clouds overlap, the 
positively charged nuclei become less well shielded by the negative electrons and so the 
two species repel each other. The repulsive term is sometimes written 


= 4 exp (-BR) 


(2.18) 


where A and B are specific to the particular molecular pair and have to determined from 
experiment. The precise form of the repulsive term is not well understood; all that is certain 
is that it must fall off quickly with distance, and the exponential function is therefore a 
possible suitable candidate. 

The total interaction is U = f/ rep + £/di P ... d ip + [4, d + U iisp , which we can write 



(2.19) 
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since all the attractive forces fall off as l/R 6 . This is known as the exp-6 potential. In the 
Lennard-Jones 12-6 potential, we take a repulsive term proportional to MR' 2 and so 


t/ L _, 


C tt _ C 6 
r™~¥ 


( 2 . 20 ) 


Once again the coefficients C 12 and C 6 have to be determined from experiments on the 
species under study. The Lennard-Jones (L-J) potential is usually written in terms of the 
well depth e and the distance of closest approach a as follows: 


t/ L _ j = 4e 


(©“-©*) 


( 2 . 21 ) 


The two L-J parameters o and e have been deduced for a range of atoms. The quantity e/k B 
(which has dimensions of temperature) is usually recorded in the literature rather than e. 
Sample atomic parameters are listed in Table 2.2. 


Table 2.2 Representative L-J atomic parameters 


Atom 

(s/k B y K 

cr/pm 

He 

10.22 

258 

Ne 

35.7 

279 

Ar 

124 

342 

Xe 

229 

406 


Over the years, people have extended these ideas to the interaction of simple molecules. 
Some caution is needed; the interaction between two molecules will generally depend on 
the precise details of their orientation, and the values given in Table 2.3 must be interpreted 
as some kind of geometrical average. 

Figure 2.6 shows a L-J 12-6 plot for argon. 


Table 2.3 L-J parameters for simple molecules 


Molecule 

(e/fe)/K 

cr/pm 

h 2 

33.3 

297 

o 2 

113 

343 

n 2 

91.5 

368 

ci 2 

357 

412 

Br 2 

520 

427 

co 2 

190 

400 

ch 4 

137 

382 

CCI 4 

327 

588 

C 2 H 4 

205 

423 

c 6 h 6 

440 

527 


Source: Hirschfelder et al. (1954). 
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Figure 2.6 Lennard-Jones 12-6 potential forAr-Ar. The plot is ofe/k B against R. 


2.9 Combination Rules 

A large number of L-J parameters have been deduced over the years, but they relate to pairs 
of like atoms. Rather than try to deduce corresponding parameters for unlike pairs, it is usual 
to use so-called combination rules, which enable us to relate the C I2 and C 6 parameters 
for an unlike-atom pair A-B to those of A-A and B-B. The use of such combination rules 
is common in subjects like chemical engineering, and is widely applied to many physical 
properties. 

There are three common combination rules in the literature. The first is 




( 2 . 22 ) 


where R* is the minimum energy separation for two atoms of type i and e, the well depth. 
The second is 
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Ci2,ij = 4 

C 6 ,ij = 4 (fr,o)) 3 (2.23) 

Finally 



C 12 , & = ^C 6 ,,(/? i+ ^) 6 (2.24) 

where a, is the dipole polarizability of atom i, N t the number of valence electrons and R, 
the van der Waals radius. 


2.10 Comparison with Experiment 

You will have got the idea by now that we have to determine the parameters in any pair 
potential by appeal to experiment. There are two kinds of experiment to consider. First 
there are those that are essentially in the gas phase, where pairs of atoms genuinely interact 
with each other unencumbered by other species. This means that the total mutual potential 
energy is given by the sum of the interacting pairs. Second there are experiments that 
essentially relate to condensed phases, where the interacting particles are sufficiently close 
to raise doubts about the credibility of the pair wise additivity assumption. 


2.10.1 Gas Imperfections 


The deviation of gases from perfect behaviour can be expressed in the form of a virial 
equation of state 


nB(T) n 2 C (T) 
nRT + V + V 2 + 


(2.25) 


where the virial coefficients B(T), C(T), . . . depend on the temperature and on the char¬ 
acteristics of the species under study. Here, n is the amount of substance, p the pressure, V 
the volume, R the gas constant and T the thermodynamic temperature. B(T) is called the 
second virial coefficient, C(T) is called the third virial coefficient, and so on. They have to 
be determined experimentally by fitting the pVT data of the gas under study. 

The virial equation of state has a special significance in that the virial coefficients can 
be related directly to molecular properties. B{T) depends on the pair potential U(R) in the 
following way: 

B (T) =2n J (l - exp (“ ^)) dR (2.26) 


2.10.2 Molecular Beams 

In a molecular beam experiment, a beam of mono-energetic molecules is produced and 
allowed to collide either with other molecules in a scattering chamber, or with a similar 
beam travelling at right angles to the original beam. Measurements of the amount by which 
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the incident beam is reduced in intensity, or the number of molecules scattered in a particular 
direction allow determination of the parameters in the pair potential. 


2.11 Improved Pair Potentials 


The L-J 12-6 potential for a pair of interacting atoms 


E/l-, (R) = 


C ] 2 

RU 


C 6 

R* 


-<■((£)■-G)‘) 


contains two parameters (e and a , or C n and C 6 ) that have to be determined by experimental 
observation. The exp-6 model 

U (R) = Aexp(-BR) - ^ 

contains three parameters, which allows for a little more flexibility. 

The Bom-Mayer-Huggins potential 


U (R) = A exp (-BK) - ^ ^ (2.27) 

contains four parameters. 

More recent investigations have concentrated on pair potentials having many more 
disposable parameters, for example 


U (r) = exp (All 




(-©I H)‘) HD") 


There is no obvious relationship between the various parameters in these different mod¬ 
els; they all have to be determined by fitting experimental data. Roughly speaking, the more 
parameters there are the better. 


2.12 A Numerical Potential 

An alternative approach is to abandon all attempts at a precise functional form and try to 
find a numerically tabulated potential by direct comparison between theory and experiment. 
In their famous 1969 paper, which is also a good reference for historically important pah- 
potentials, Dymond and Alder (1969) say 
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A study of the effect of variations in the interatomic potential of argon on the calculated 
properties of the dilute gas has led to a numerically tabulated potential which satisfactorily 
agrees with all available equilibrium and transport data. This potential differs significantly 
from previously used pair potentials. 

Their approach makes use of the second virial coefficient mentioned above. 


2.13 Site-Site Potentials 

The L-J potential has played an important role in the history of molecular modelling. Early 
work focused on atoms, but, as I explained, there were many ambitions attempts to model 
simple molecules as if they were in some way L-J atoms, and the parameters have to be 
interpreted as applying to some kind of average over molecular rotations (and presumably 
vibrations). 

Suppose now that we want to try and understand the interaction between two dinitrogen 
molecules in more depth. In view of our discussion above, the instantaneous interaction 
energy will clearly depend on the separation of the two diatoms, together with their mutual 
angular arrangement in space. 

Figure 2.7 shows two such dinitrogens, oriented arbitrarily in space with respect to each 
other. Nitrogen A and nitrogen B make up a stable diatom, as do atoms C and D. We ignore 
the fact that the molecules have vibrational energy, and the two diatoms are taken to be 
rigid. As a first approximation, the mutual potential energy of the pair of diatoms could be 
calculated by adding together the appropriate L-J parameters. If I write as shorthand the 
L-J interaction between atoms A and C as 



then the total interaction between the two diatomics is taken 


U L _ S = U L _ j (A, C) + t/ L _j (A, D) + f/ L _j (B, C) + t/ L _j (B, D) (2.29) 
Such a potential is called a site-site potential. We will meet such potentials later in the text. 



A 


"V 


D 


Figure 2.7 Two dinitrogen molecules, arbitrarily oriented 
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2.14 Problem Set 

A problem set, with solutions, is available on the associated website. 
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Balls on Springs 


The theory of intermolecular forces relates to atomic and/or molecular species that are some 
distance apart (say, a few bond lengths). We saw in Chapter 2 that progress can be made in 
such a theory without normally invoking the concepts of quantum mechanics. If we want 
to truly understand why two atoms combine together to give a chemical bond, and how 
bonds get broken and reformed in chemical reactions, then we enter the realms of valence 
theory. Quantum mechanics plays a dominant part in such discussions. 

These are simple-minded comments and my arbitrary division of molecular interactions 
is subjective. At first sight, the stability of an NaCl ion pair can be explained in terms 
of elementary electrostatics, and we can usefully model argon liquid without recourse to 
quantum mechanics (apart from the London dispersion potential, which is a ‘pure’ quantum 
mechanical effect). AC-C bond in ethane is at first sight a quantum mechanical animal, and 
we will certainly have to invoke quantum mechanical ideas to explain the reaction of ethene 
with dichlorine. But there are grey areas that I can bring to your attention by considering the 
phenomenon of hydrogen bonding. The hydrogen bond is an attractive interaction between 
a proton donor X-H and a proton acceptor Y in the same or a different molecule 


X-H...Y 

The bond is usually symbolized by three dots as shown above, in order to reconcile the 
existence of compounds such as 

NH 3 ... HC1 

with the trivalence of nitrogen, the divalence of oxygen in oxonium salts and other com¬ 
pounds that apparently break the classical valence rules. Hydrogen bonds typically have 
strengths of 10-100 kJ mol" 1 . The lone pairs of oxygen and nitrogen and the partially 
charged character of the proton were eventually recognized as the sources of this bond. The 
first reference to this ‘weak bond’ was made by Latimer and Rodebush (1920). 
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The individual monomers X-H and Y retain their chemical identity to a large extent on 
hydrogen bond formation; in other words no new covalent bond gets made. A great deal 
of evidence suggests that simple electrostatic models of the hydrogen bond give perfectly 
acceptable quantitative descriptions of the structure, vibrations and electric dipole moments 
of such hydrogen bonded species. 

The hydrogen-bonded species 


FHF“ 


is well known and has been well studied, but it cannot be written 


F-H...F- 


because the proton is equally shared between the two fluorine atoms. Such a species is best 
thought of as covalently bound, and has to be treated by the methods of molecular quantum 
theory. 

Having warned about bond breaking and bond making, I should tell you that a great deal 
of molecular modelling is concerned with the prediction and rationalization of molecular 
bond lengths and bond angles. Here we usually deal with isolated molecules in the gas phase 
and the theoretical treatments often refer to 0 K. A surprising amount of progress can be 
made by treating molecules as structureless balls (atoms) held together with springs (bonds). 
The array of balls and springs is then treated according to the laws of classical mechanics. 
Such calculations are remarkably accurate, and are taken very seriously. 


3.1 Vibrational Motion 

To get started, consider a particle of mass m lying on a frictionless horizontal table, and 
attached to the wall by a spring as shown in Figure 3.1. 



Table 


Figure 3.1 Ball attached to wall by spring 


The particle is initially at rest, when the length of the spring is R e (where the subscript ‘e’ 
stands for equilibrium). If we stretch the spring, it exerts a restoring force on the particle 
whilst if we compress the spring there is also a force that acts to restore the particle to its 
equilibrium position. If R denotes the length of the spring, then the extension is R - R e and 
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if F s is the restoring force due to the spring, it is often observed experimentally that the 
force is directly proportional to the extension: 

F s = -k s (R-R t ) (3.1) 


The constant of proportionality k s is called the force constant and it tells us the strength of 
the spring. This law is known as Hooke’s low and it applies to very many springs made 
from many different materials. It invariably fails for large values of the extension, but is 
good for small deviations from equilibrium. 

Suppose that we now set the particle in motion, so that it oscillates about R t . According 
to Newton’s second law we have 

d 2 R 

m— = -K (R - tfe) (3.2) 

df 2 

This second-order differential equation has the general solution 


R = R e + A sin 




(3.3) 


where A and B are constants of integration. These constants have to be fixed by taking 
account of the boundary conditions. For example, if the particle starts its motion at time 
t = 0 from R — R e then we have 


R t = R t + A sin 




from which we deduce that B = 0 for this particular case. Normally we have to find A and 
B by a similar procedure. 

The trigonometric functions sine and cosine repeat every 2 jt and a little manipulation 
shows that the general solution of Equation (3.3) can also be written 



The quantity yfkjm has the dimension of inverse time and it is obviously an import¬ 
ant quantity. We therefore give it a special symbol (co) and name (the angular vibration 
frequency). We often write the general solution as 

R — R c + A sin (cot) + B cos (rot) (3.4) 

A typical solution is shown as Figure 3.2 (for which I took A = lm,B = 0,m — 1 kg and 
k s — 1 N m -1 ). Such motions are called simple harmonic. At any given time, the displace¬ 
ment of the particle from its equilibrium position may be nonzero, but it should be clear 
from Figure 3.2 that the average value of the displacement R-R t is zero. As noted earlier, 
it is usual to denote average values by angle brackets < ... > and so we write 


{R -R e ) = 0 
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t/s 


Figure 3.2 Simple harmonic motion 


It should also be clear from Figure 3.3 that the average value of (R - R c ) 2 is not zero. 
A direct calculation using < sin 2 (ait) > = l / 2 gives 

((R-R e f) = 1 -{A 2 +B 2 ) (3.5) 



t/s 


Figure 3.3 Variation of (R - R,.) 2 with time 
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3.2 The Force Law 


If we use the general one-dimensional result that links force and mutual potential energy 


U(x) = - 


/ 


F(x) dx 


we find 

U(R) = U(R e ) + C R ~ Rtf (3.6) 

where U(R e ) is the value of U (R) at the equilibrium position. As mentioned earlier, we 
often set this constant of integration arbitrarily to zero and so 

U ( R ) = l -k s {R - R e ) 2 (3.7) 


Because the motion is simple harmonic, we refer to this potential as a harmonic potential. 
The potential energy varies over each cycle, in the same way as shown in Figure 3.3. The 
average value of the potential energy over a cycle is 

(U(R)) = \k s [(R-R e ) 2 ) 

= \{A 2 + B 2 ) (3.8) 

Finally, the kinetic energy T is given by 


The average value is 

{T(K))= \mco 2 (A 2 +B 2 ) 
— ( A 2 + B~) 


(3.9) 


The kinetic and potential energies vary with time, but the total energy U(R) + T(R) 
is constant: it does not vary with time. The average value of the kinetic energy over a 
cycle is equal to the average value of the potential energy, each of which is one half of 
the total energy. I am going to use the symbol e for energy when referring to a single 
atom or molecule, throughout the text. The total energy e can take any value with no 
restrictions. 


3.3 A Simple Diatomic 

Consider next the Hooke’s law model of a diatomic molecule (Figure 3.4). The atoms have 
masses mi and m 2 , and the spring has a harmonic force constant k s . 

I am going to consider the motion along the horizontal axis, which I will refer to as 
the x axis. The x coordinates of the two atoms are x, and x 2 relative to an arbitrary axis. 
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™i m 2 

Figure 3.4 Diatomic molecule 


the equilibrium length of the spring is R e and the length of the extended spring at some 
given time is 

R= x 2 — x i 

The spring extension is therefore 

X 2 — Xi — Re 


Considering atom 1, the spring exerts a force of k s (x 2 — x t — R c ) and so, according to 
Newton’s second law 


d 2 x t 
mi d t 2 


-Rc) 


= k s (R — R t ) 


(3.10) 


As far as atom 2 is concerned, the extended spring exerts a force of magnitude k s (x 2 —x, — R c ) 
in the direction of decreasing x 2 and so 

d 2 x 2 

w 2 — = -k s (x 2 - xi - R e ) 

= -k s (R - Re) (3.11) 


After a little rearrangement we find 


d 2 R 
df^ : 


K 

m 2 


{R - Rz) 


K 

-(R- RJ 


= -K 


(R-Rc) 


We now define a quantity ji, called the reduced mass, by 
i _ i 1 

/u mi m 2 


and so we have 


d 2 R 

= -K (R - Rc) 

d t- 


(3.12) 


(3.13) 


which is identical to Equation (3.2) already derived for a single particle of mass /x on 
a spring. 

The general solution is therefore 


R — Re + A sin 



(3.14) 


o = s/K/d- 


and the angular frequency is 
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It is easy to demonstrate that the potential energy is 


U = -k s (x 2 -x, - R e ) 7 


and the total energy e vib of the harmonically vibrating diatomic is therefore 



m 



(3.15) 


3.4 Three Problems 

This simple treatment suggests three problems. First, how do we determine the spring 
constant for a simple molecule such as 1 H 35 C1 or 12 C ls O? Second, how good is the harmonic 
approximation? Third, have we missed anything by trying to treat a molecular species as 
if it obeyed the laws of classical mechanics rather than quantum mechanics? 

The three questions are interlinked, but let me start with the third one. The experimental 
evidence suggests that we have made a serious error in neglecting the quantum mechanical 
details. If we irradiate a gaseous sample of 1 H 35 C1 with infrared radiation, it is observed 
that the molecules strongly absorb radiation of wavenumber 2886 cm -1 . With hindsight we 
would of course explain the observation by saying that the molecular vibrational energies 
are quantized. A major flaw of the classical treatment is that the total vibrational energy is 
completely unrestricted and quantization does not arise. 

The quantum mechanical treatment of a harmonically vibrating diatomic molecule 
is given in all the elementary chemistry texts. The results are quite different from the 
classical ones. 

1. The vibrational energy cannot take arbitrary values, it is quantized. 

2. There is a single quantum number v which takes values 0, 1,2,... This is called the 
vibrational quantum number. 

3. Vibrational energies e vib are given by 



where h is Planck’s constant. 

The results are usually summarized on an energy level diagram, such as that shown 
in Figure 3.5. I have just drawn the first four vibrational energy levels, but there are an 
infinite number of them. According to the harmonic model, the spacing between the levels 
is constant. 

A careful investigation into the mechanism by which electromagnetic radiation interacts 
with matter suggests that transitions between these vibrational energy levels are allowed, 
provided the vibrational quantum number changes by just 1 unit. So molecules with v = 0 
can absorb radiation of exactly the right energy for promotion to v = 1. Molecules with 
v = 1 can either absorb radiation with exactly the right energy for promotion to v = 2 or 
they can emit radiation and fall to v = 0, and so on. 
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v = 3 


v = 2 


v = 0 


Figure 3.5 Vibrational energy levels 


According to the quantum model, then, molecules can only have certain vibrational 
energies and this behaviour is totally at variance with the classical treatment. 

Also, the quantum treatment differs from the classical treatment in that the lowest energy 
is that with v = 0, where the energy is nonzero. This is called the zero-point energy. Accord¬ 
ing to the classical treatment, a molecule can be completely at rest. According to the quantum 
treatment, the lowest vibrational energy allowed is the zero-point energy. 

How do we measure the spring constant? According to the harmonic quantum model, 
the energy difference between any consecutive pair of energy levels is given by 



(3.16) 


so all we need to do is to measure this energy difference experimentally. The reduced mass p 
of the 1 H 35 C1 isotopic species is 1.6267 x 10 -27 kg and substitution of the experimental value 
(2886 cm -1 ) into the energy difference gives the harmonic force constant as 480.7 N m -1 . 

In fact, there is more to the experiment than I have told you. Spectroscopic experiments 
are done at finite temperatures and a given sample of N molecules may have many energy 
levels populated. Relative populations N v are given by the Boltzmann formula 



(3.17) 


Substitution of values into the formula shows that for many everyday diatomic molecules 
at everyday temperatures, the only vibrational level populated is that with v = 0. So an 
infrared absorption spectrum should just show a single absorption, corresponding to the 
transition v = 0 to v = 1. 

A closer examination of the 1 H 35 C1 spectrum shows weak absorptions at 5668, 
8347,... cm -1 , which are nearly (but not exactly) two and three times the fundamental 
vibration frequency. 

The existence of these lines in the spectrum shows that our assumption of Hooke’s law is 
not completely correct. Figure 3.6 shows the ‘experimental’ energy level diagram compared 
to the harmonic one. 
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Harmonic Experimental 


Figure 3.6 Harmonic versus experimental energy levels 


Our conclusion from the experimental data is that vibrational energy levels get pro¬ 
gressively closer together as the quantum number increases. This suggests that whilst the 
harmonic model is a reasonable one, we need to look more carefully at the form of the 
potential in order to get better agreement with experiment. 


3.5 The Morse Potential 


Professional spectroscopists would be unhappy with the idea of using Hooke’s law as a 
model for the vibrational motion. They would be more concerned with matching their 
experimental energy levels to a more accurate potential. Many such potentials have been 
used over the years, with that due to Morse being widely quoted in elementary chemistry 
texts. The Morse potential is as follows: 

U = D e {1 — exp [—/3(/? — /? e )]} 2 (3.18) 


where D e is the depth of the potential well, i.e. the thermodynamic dissociation energy, and 


/» = 



\| 2D t 


We 

2 Y D t 


(3.19) 


This potential contains three parameters D t , w e and R e , and so should be capable of giving a 
better representation to the potential energy curve than the simple harmonic, which contains 
just the two parameters k s and R e . 

In the case of 1 H 35 C1, a simple calculation shows that the dissociation energy 
1 


D e = D 0 - 1 —h ( 2.71(0 e ) 


(3.20) 
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is 4.430 + 0.186eV = 4.616 eV. The Morse potential for 1 H 35 C1 is shown in Figure 3.7, 
compared to the simple harmonic model. The full curve is the simple harmonic potential, 
the dashed curve the Morse potential. 



Figure 3.7 Simple harmonic and Morse curves for HCI 


3.6 More Advanced Potentials 

More often than not, the following spectroscopic constants are available for a diatomic 
molecule: 

R e the equilibrium internuclear separation; 

D e the dissociation energy; 
k s the force constant; 

ftj e x c the anharmonicity constant (sometimes written x e only); 
a c the vibration-rotation coupling constant. 

Usually these five constants can be found to good experimental accuracy. 

There are a number of three- to five-parameter potential functions in the literature, of 
which the Morse potential is the most popular. A typical five-parameter potential is the 
Linnett function (Linnett 1940, 1942) 

U(x) = ^ m ~ bexp(-tx)[\ + gi(x - 1) + g 2 (x - l) 2 ] 
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The five parameters K, b, t, gi and g 2 can be related to the molecular constants. At this point 
I should explain how we recover the force constant from such a complicated expression, 
and to do this I use a Taylor expansion of the potential about the equilibrium bond length: 


/ dU\ 1 , / d 2 U\ 

um = u (* e ) + (*-*) (^ + 5 CR - *)’ ■ (3-21) 


U(R) is obviously equal to U(R c ) when R — R e , and this fixes the constant of integration. 
The equation is sometimes written as 


mm-u m = (*-*.> (f) ^ + \ <* - «.) ! 

1 


where it is understood that U(R) is measured relative to the potential energy minimum (that 
is to say, we take the zero as U(R e )). 

The quantity dU/dR is of course the gradient of U. The second derivative evaluated at 
the minimum where R = R e is called the (harmonic) force constant. To find R c we solve 
the equation dU/dR = 0 and substitute this value into the second derivative to evaluate 
the force constant. In the special case of a harmonic potential, the second derivative is a 
constant and is equal to the force constant. 
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4 

Molecular Mechanics 


In Chapter 3,1 showed you how to use classical mechanics to model the vibrational motion 
of a diatomic molecule. I also explained the shortcomings of this treatment, and hinted at 
applications where a quantum mechanical model would be more appropriate. We will deal 
specifically with quantum mechanical models in later chapters. 


4.1 More About Balls on Springs 

It is time to move on to more complicated molecules. I want to start the discussion by 
considering the arrangement of balls on springs shown in Figure 4.1. 

We assume that the springs each satisfy Hooke’s law. I will call the spring constant of 
the left-hand spring k x and the spring constant of the right-hand spring k 2 . The equilibrium 
position corresponds to the two masses having x coordinates R lfi and R 2fi , and we constrain 
the motion so that the springs can only move along the x axis. The particle masses are 
shown in Figure 4.1. 



Figure 4.1 Two balls, two springs 
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We then stretch the system so extending the two springs, and I will call the instantaneous 
positions of the two masses x t and x 2 . The extensions of the springs from their equilibrium 
positions are 


ft = *1 - *l,e 
ft = *2 - *2,e 


Consider the left-hand spring; it exerts a restoring force on particle 1 of—ft §\ . Now consider 
the right-hand spring. This spring is stretched by an amount ($ 2 - ft), and so it exerts a 
force of ft (ft — ft); this force acts to the left on particle 2 and to the right on particle 1. 
Application of Newton’s second law gives 



(4.1) 


There are many different solutions to these simultaneous differential equations, but it proves 
possible to find two particularly simple ones called normal modes of vibration. These 
have the property that both particles execute simple harmonic motion at the same angular 
frequency. Not only that, every possible vibrational motion of the two particles can be 
described as linear combinations of the normal modes. 

Having said that it proves possible to find such solutions where both particles vibrate 
with the same angular frequency w, let me assume that there exist such solutions to the 
equations of motion such that 


ft (t) = A sin(cut + (fit ) 
ft(f) = B sin (cot + <p 2 ) 


A, B, <pi and (p 2 are constants that have to be determined from the boundary conditions. 
Differentiating these two equations with respect to time gives 



and substituting these expressions into the equations of motion gives 




(4.2) 


These two equations are simultaneously valid only when co has one of two possible values 
called the normal mode angular frequencies. In either case, both particles oscillate with the 
same angular frequency. 
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In order to investigate these normal modes of vibration, I write the above equations in 
matrix form, and then find the eigenvalues and eigenvectors as follows: 



(4.3) 


Matrix diagonalization gives the allowed values of co 1 (the eigenvalues), and for each value 
of —co 2 we calculate the relevant combinations of the £ values (the eigenvectors). The 
eigenvectors of the matrix are called the normal coordinates. 


4.2 Larger Systems of Balls on Springs 

For a molecule comprising N atoms, there are 3 N Cartesian coordinates. Of these, three 
can be associated with the position of the centre of mass of the whole molecule and three 
for the orientation of the molecule at the centre of mass (two for linear molecules). This 
leaves 3N-6 vibrational degrees of freedom (3 N - 5 if the molecule is Unear), and it is 
appropriate to generalize some concepts at this point. I am going to use matrix notation, in 
order to make the equations look friendlier. 

The molecular potential energy U will depend on p = 3N — 6 (independent) variables. 
For the minute, let me call them q u q 2 , ..., q p , and let me also write q he , q 2fi ,..., q pfi for 
their ‘equilibrium’ values. These coordinates are often referred to as internal coordinates, 
and they will be linear combinations of the Cartesian coordinates. 

First of all, for the sake of neatness, I will collect all the <y’s into a column matrix q. I will 
also collect together the ‘equilibrium’ values into a column matrix q e and the extensions 
into a column matrix f: 



(4.4) 


I will now write t/(q) to indicate the dependence of U on these variables. If I use Taylor’s 
theorem to expand U (q) about point q e then the one-dimensional equation 




+ 2^-^e) 2 


(given previously in Chapter 3) has to be modified to take account of the larger number of 
variables. First derivatives become partial first derivatives, and we have to take account of 
the ‘mixed’ second-order derivatives: 


t/(q)-[/(q e ) = 



(4.5) 
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In ordinary vector differentiation, we meet the gradient of a scalar field /, defined in 
Cartesian coordinates as 

, 9/ 9/ 9/ 

grad/ = —e, + —e, + —e, 

where t x , e v and e, are Cartesian unit vectors. When deahng with functions of many variables 
it proves useful to make a generalization and write the gradient of (for example) U as 

/ dU\ 

dqi 
3U 
dq 2 


V dq p / 

so grad U is a column matrix that stores all the partial derivatives. This ‘vector’ will occur 
many times through the text, and I am going to give it the symbol g (for gradient). 

The second derivatives can be collected into a symmetric p x p matrix that is called the 
Hessian of V and I will give this the symbol H. In the case where p = 3, we have 



d 2 U 

d 2 u 

d 2 U 


3q\ 

dq 3 dq 2 

dq\dq 3 


d 2 U 

d 2 U 

d 2 U 


dq 2 dqi 

dqi 

dq 2 dq 3 


d 2 U 

d 2 U 

d 2 u 

V 

dq 3 dq l 

dq 3 dq 2 

dqi 


The Taylor expansion then becomes 

U (q) - U (q e ) = fg + + ■ • • (4.8) 

Both the gradient and the Hessian have to be evaluated at the point q e , and so you will 
sometimes see the equation written with an ‘e’ subscript: 

t/(q) - C/(q e ) = f T g e + y r H e £ + • ■ ■ 

The superscript T as in £ T indicates the transpose of a matrix; the transpose of a column 
matrix is a row matrix. The Hessian is often referred to as the force constant matrix. 

Finally, if I denote the 3N Cartesian coordinates X u X 2 ,... ,X 3N , we usually write the 
transformation from Cartesian coordinates to internal coordinates as 


q = BX 


(4.9) 


where the rectangular matrix B is called the Wilson B-matrix. The B matrix has p rows and 
3 N columns. 
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4.3 Force Fields 

I have been vague so far about which variables are the ‘correct’ ones to take. Chemists 
visualize molecules in terms of bond lengths, bond angles and dihedral angles, yet this 
information is also contained in the set of Cartesian coordinates for the constituent atoms. 
Both are therefore ‘correct’; it is largely a matter of personal choice and professional 
training. I should mention that there are only 3N-6 vibrational coordinates, and so one has 
to treat the 3N Cartesian coordinates with a little care; they contain three translational and 
three rotational degrees of freedom. More of this technical point will be discussed later. 

Spectroscopists are usually interested in finding a set of equilibrium geometric paramet¬ 
ers and force constants that give an exact fit with their experimental data. This is harder than 
it sounds, because for a molecule comprising N atoms and hence p = 3N — 6 vibrational 
degrees of freedom, there are 'kpip — 1) force constants (diagonal and off-diagonal). In 
order to measure the individual force constants, the spectroscopist usually has to make 
experimental measurements on all possible isotopically labelled species. It usually turns 
out that there are many more unknowns than pieces of experimental information. Spectro¬ 
scopists usually want a force field (comprising force constants, equilibrium quantities and 
every other included parameter) that is specific for a given molecule. They want to match 
up ‘theory’ with their incredibly accurate measurements. 

Many of the ‘off-diagonal’ force constants turn out to be small, and spectroscopists have 
developed systematic simplifications to the force fields in order to make as many as possible 
of the small terms vanish. If the force field contains only ‘chemical’ terms such as bond 
lengths, bond angles and dihedral angles, it is referred to as a valence force field (VFF). 
There are other types of force field in the literature, intermediate between the VFF and the 
general force field discussed above. 


4.4 Molecular Mechanics (MM) 

Molecular modellers usually have a quite different objective; they want a force field that can 
be transferred from molecule to molecule, in order to predict (for example) the geometry of 
a new molecule by using data derived from other related molecules. They make use of the 
bond concept, and appeal to traditional chemists’ ideas that a molecule comprises a sum 
of bonded atoms; a large molecule consists of the same features we know about in small 
molecules, but combined together in different ways. 

The term molecular mechanics was coined in the 1970s to describe the application of 
classical mechanics to determinations of molecular equilibrium structures. The method 
was previously known by at least two different names: the Westheimer method and the 
force-field method. The name and acronym MM are now firmly established. 

The idea of treating molecules as balls joined by springs can be traced back to the 
work of Andrews (1930). A key study on the development of MM was that by Snyder and 
Schachtschneider (1965), who showed that transferable force constants could be obtained 
for alkanes provided that a few off-diagonal terms were retained. These authors found that 
off-diagonal terms are usually largest when neighbouring atoms are involved, and so we 
have to take account of nonbonded interactions but only between next nearest neighbours. 
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A final point for consideration is that we must also take account of the chemical envir¬ 
onment of a given atom. An sp carbon atom is different from an sp 2 carbon atom and so 
on. It is traditional to speak of atom types in MM. 

Our idea is to treat the force field as a set of constants that have to be fixed by appeal to 
experiment or more rigorous calculation. In MM we take account of nonbonded interactions, 
and also the chemical sense of each atom. A valence force field that contains nonbonded 
interactions is often referred to as a Urey-Bradley force field. 


4.4.1 Bond Stretching 

If we consider phenylanine (the structure of which is shown in Figure 4.2) we can identify 
a variety of bond types including C(sp 2 ) — C(sp 2 ), C(sp 2 ) — C(sp 3 ), O — H, C = O, and so 
on. If we assume that Hooke’s law is adequate, then each bond stretch between atom types 
A and B makes a contribution to the total molecular potential energy of 

U AB = ] -k A n (Rab ~ Rz, ab ) 2 (4.10) 

in an obvious notation. Parameter Icab is the force constant, R Aft the instantaneous bond 
length and R CiA b the equilibrium bond length. 


O 


NH 2 



Phenylanine 

Figure 4.2 Structure of phenylanine 


Other scientists recommend the Morse potential: 

Uab = D{1- exp[—«(*« - R^)]} 2 

whilst some recommend the addition of extra terms to the simple harmonic expression 

U A b = k, (R a b - R cAB ) 2 + k 2 (R a b - R cAR ) 4 (4.11) 

4.4.2 Bond Bending 

Next we have to consider the bond bending vibrations. It is usual to write these as harmonic 
ones, typically for the connected atoms A-B-C 

I 


(4.12) 
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where k is the force constant, and the subscript ‘e’ refers to the equilibrium value where 
the molecule is at rest. A variation on the theme is given by 

(cos 0 abc — cos #ABC,e) 2 (4.13) 


2 Sin 0ABC,e 


4.4.3 Dihedral Motions 

Next we must consider the dihedral angle ABCD between the four bonded atoms A, B, C 
and D (Figure 4.3). 



Some authors divide these into proper dihedrals, where we might expect full rotation 
about the connecting bond B-C, and improper dihedrals where the rotation is limited. For 
example, if C-D were a C-H fragment of a methyl group we would be expect full rotation 
about B-C and a three-fold symmetry in the potential energy term. A-CFI-CH- linkage in 
a benzene ring would only show a moderate flexing from its planar value (angle zero). 

If we use x to denote the ABCD angle, then a popular dihedral potential is given by 

-cos (/!(*-*.))) (4.14) 

Here n is the periodicity parameter, which would be 3 for a methyl group. Angle Xc is the 
equilibrium torsional angle. A more complicated example is given by 

Vi V 2 y 3 

U = — (1 + cos (jhx ~ gi)) + Jr (f + cos (n 2 x ~ 82 )) + — (1 + cos (n 3 x - g 3 )) 

The V’s are energy terms, the ns are periodicity parameters, the g’s are phase parameters 
and N d is a constant that depends on the number of bonds. 

Some authors treat improper dihedrals in the same way as bond bending, and take a 
contribution to the molecular potential energy as 

U ABCD = i*ABGD (XABCD - ^ABCd)' (4.15) 

where x is the dihedral angle, as above. 
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4.4.4 Out-of-Plane Angle Potential (Inversion) 

Next we consider the out-of-plane potential terms. Imagine molecule ABCD in Figure 4.4 
to be ammonia, a molecule with a very low barrier to inversion; as i// changes from positive 
to negative, the molecule inverts. We can write the inversion potential either in terms of 
the angle indicated (i/r) or in terms of the height (h) of atom D above the plane of atoms A, 
B and C. Four examples are as follows: 

ki , 

U = -^— (cos 1 jr — COS l/r e ) 

2 sin i/'e 

U = k 2 h 2 (4.16) 

U = k 3 (1 + k A cos(m jr)) 

U = k s (1 + cos(m jr — k 6 )) 

where n is a periodicity parameter, the k ’s are constants that have to be fixed against 
experiment and \js e is the equilibrium value. 



Figure 4.4 Out-of-plane (inversion) potential 


4.4.5 Nonbonded Interactions 

I mentioned earlier that molecular mechanics force fields have to be transferable from 
molecule to molecule, and explained the necessity for nonbonded interactions. These are 
usually taken to be Lennard-Jones 12-6 type, and they are included between all nonbonded 
pairs of atoms: 



The Born-Mayer-Huggins potential 

i/,„=Aexp(-SR,-S_^ 

is sometimes used when dealing with polar species. B is a parameter determined by the 
size and ‘softness’ of an ion, C' (not the same as C 6 in U L _ j) has to do with dipole-dipole 
interactions whilst C 8 is determined by dipole-quadrupole interactions. 

Some force fields make special provision for hydrogen-bonded atoms; they treat them 
as nonbonded interactions but soften the Lennard-Jones 12-6 potential for A-H... B to a 
12-10 version: 
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U w 


C|2 C,o 


(4.17) 


Other authors take the view that hydrogen bonds are perfectly respectable chemical bonds 
that should be treated just like any other bond. They are therefore given a force constant 
and so on. 


4.4.6 Coulomb Interactions 


Many force fields take account of electronegativity differences between atoms and add 
electrostatic terms. Atomic charges Q A and <2b are assigned to atoms A and B according to 
the rules of the particular force field, and we write 


1 Q a < 

4jre 0 R 


4.5 Modelling the Solvent 

I should remind you that the electrostatic expression above relates only to point charges 
in free space. In the presence of a dielectric material (such as water), the force between 
point charges is reduced by a factor e r called the relative permittivity. Many force fields 
were developed at a time when it was not feasible to include a solvent explicitly in such 
calculations. 

Various attempts were made to allow for the effect of a solvent; the most obvious thing to 
do is to alter the relative permittivity even though no solvent molecules are actually taken 
into account. There is no agreement between authors as to the correct value of e r and values 
ranging between 1 and 80 have been used for water. Some force fields take e r proportional 
to the distance between the point charges. I will explain some more up-to-date ways of 
modelling the solvent in later chapters. 


4.6 Time- and Money-Saving Tricks 

All the contributions to the molecular potential energy U given above can be done on a 
pocket calculator. The larger the molecular system, the larger the number of individual 
contributions to U and the relationship between molecular size and computational effort 
is roughly dependent on the square of the number of atoms. Over the years, people have 
tried to reduce the computational time for a given problem by the use of various tricks of 
the trade. Two such methods are as follows. 

4.6.1 United Atoms 

Some professional force fields use the so-called united atom approach. Here, we regard 
(for example) a CH 3 group as a pseudo-atom, X, and develop parameters for a C(sp 2 )-X 
stretch, and so on. It is customary to treat methyl, methylene and methane groups as united 
atoms, especially when dealing with large biological systems. 
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4.6.2 Cut-Offs 

For a large molecule, there are many more nonbonded interactions than bonded interactions. 
Molecular mechanics force fields very often cut these off to zero at some finite distance, in 
order to save computer time. This can sometimes lead to mathematical difficulties because 
of the discontinuity, and various ingenious methods have been proposed to circumvent the 
problem (other than actually retaining the terms). I will show you in a later chapter that 
there are other problems associated with this cut-off procedure; it is a real problem, not just 
one that I have mentioned out of historical interest. 

Figure 4.5 shows a Lennard-Jones 12-6 potential with a cut-off (after a certain value of 
R, the potential is set to zero). 



Intermolecular distance R 


Figure 4.5 Schematic cut-off of Lennard-Jones potential 


4.7 Modern Force Fields 


A ‘standard’ modern MM force field can 
U=J2 £/ab+ Uabc + Uab 


>e written 
>+ £ 


£ 


(4.18) 
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or written explicitly in terms of the contributions discussed above 


U 


= E \ k ** ( R ™ ~ W + E ^abc (0abc - W) 2 

+ E W C 1 -cos (re (x -Xo)))+ E „ A , (cos f - cosVO 2 

dihedrals 2 ou.-^aee 2 S1 " ^ 


(4.19) 


(i-s) 


1 6a 6b 

47r£ °c^s 


4.7.1 Variations on a Theme 

There are a number of variants of this expression in the literature. Some force fields contain 
mixed terms like 

\(R~ K) (8-0 B ) 

which couple together the bond stretching modes with angle bending. Others use more 
complicated expressions for the individual bending and stretching terms. Some force fields 
allow interactions between lone pairs, which are often referred to as nonatomic interaction 
centres. In addition, there are specialist force fields that are appropriate for restricted ranges 
of compounds such as ions, liquid metals and salts. 

Force fields are determined by one of two routes. First of all, in an ideal world, one might 
calibrate their parameters against accurate quantum mechanical calculations on clusters of 
small molecules. The alternative is to calibrate against experimental data such as crystal 
structure, infrared absorption, X-ray measurements and liquid properties such as density, 
enthalpy of vaporization, Gibbs energies of solvation and the like. To date, almost all 
modem force fields have been obtained by the latter approach. 

The choice of a particular force field for a given application should depend on the type 
of system for which the force field was designed. For example, some force fields have been 
calibrated against the solution properties of amino acids. These are obviously the ones to 
choose when it comes to modelling proteins in solution. 

Finally, I must emphasize the importance of the atom type (i.e. the chemical environment). 
The chemical environment of an atom can be distinguished by 

• its hybridization 

• its formal atomic charge 

• its nearest neighbours. 

For example, one well-known force field distinguishes five types of oxygen atom: 

1. A carbonyl oxygen 

2. A hydroxyl oxygen 

3. A carboxylic or phosphate oxygen 

4. An ester or ether oxygen 

5. An oxygen in water. 

The interactions are calculated according to atom type, not the ‘raw’ elements. 
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4.8 Some Commercial Force Fields 

With these principles in mind, it is time to examine some of the common force fields found 
in professional molecular modelling programs. 

4.8.1 Dreiding 

This force field (Mayo et ctl. 1990) is parameterized for all atom types that any chemist 
would expect for the elements H, C, N, O, P, S, F, Cl, Br and I. In terms of the ‘standard’ 
expression we write 



U 


+ y (1 - cos (n (x - Xo))) + ~ V ^) 2 (4.20) 


out-of-plane 



4.8.2 MM1 

In his review, Allinger (1976) essentially defined what we now call the MM1 force field. He 
treated hydrocarbons only, ignored the Coulomb terms and used an exp-6 Lennard-Jones 
potential. 

Once all the standard cases have been successfully treated, one naturally looks at the 
difficult ones. In the case of hydrocarbons, these difficult cases comprise strained rings 
such as cyclobutane. The problem with cyclobutane is that, whilst having all carbon atoms 
planar can minimize the angular deformation from tetrahedral, the molecule is actually 
puckered by a substantial angle from planarity. In addition, the C-C bond lengths are 
unusually large. The obvious solution is to say that a four-membered ring is different from 
any other hydrocarbon and that the bond angle does not have a natural tetrahedral value, 




Bicyclic and related molecules 


Figure 4.6 Bicyclic and related molecules 
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but one then goes down the undesirable path where all difficult cases have their own set of 
parameters. 

Allinger and others introduced a variety of ‘mixed’ terms into the standard MM potential; 
for example, a bond length-bond angle term and a torsion-bend interaction. 

Figure 4.6 shows typical bicyclic and related hydrocarbons described in Allinger (1976). 


4.8.3 MM2 (Improved Hydrocarbon Force Field) 

Allinger (1977) subsequently introduced MM2. At the time there was a deal of discussion 
in the literature about how different force fields should represent hydrogen atoms: as ‘hard’ 
or ‘soft’ atoms. A hard atom was said to be one whose plot of force versus distance (in the 
diatom) showed a steep slope. A soft atom was one where the slope was gentler. It all boiled 
down to the repulsive part of the nonbonded interactions but eventually Allinger decided 
to retain his exp-6 intermolecular potential. 

MM2 differs from MM1 in three main respects. 

1. The dihedral term was extended to 


U, [/, Ui 

U = — (1 + cos (o) + — (1 + cos 2 co) + — (1 + cos 3 co) 

where each of the t/’s was found by calibration against experiment. 

2. The bending term was extended to 


(4.21) 



(4.22) 


3. All mention of cross terms between bond stretches and bends were finally dropped. 

A great deal of importance was attached to the calculation of enthalpies of formation, and 
42 hydrocarbons were treated. The author claimed that his enthalpy results were comparable 
to experiment in terms of experimental error. 


4.8.4 AMBER 

AMBER (an acronym for assisted model building and energy refinement; Weiner et 
al. 1984) is a force field for the simulation of nucleic acids and proteins. It was 
calibrated against experimental bond lengths and angles obtained from microwave, 
neutron diffraction and accurate quantum chemical studies. The parameters were then 
refined with MM studies on model compounds such as tetrahydrofuran, deoxyaden- 
osine, dimethyl phosphate, 9-methyladenine-l-methylthymine hydrogen bonded complex 
and others. 

The model differs from our standard expression (Equation (4.19)) in four ways. 

1. Hydrogen bonds were included explicitly with a 12-10 potential 
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2. An attempt was made to include solvent effects by inclusion of the Coulomb term with 
a distance-dependent relative permittivity. 

3. The AMBER force field is a ‘united atom’ one, and hydrogen atoms bonded to car¬ 
bons are not explicitly included. They are absorbed into the atom type parameters for 
neighbouring atoms. 

4. Lone pairs were explicitly included for sulfur hydrogen bonding. 

There are a number of different versions of AMBER; the original united atom version 
was later extended to include all atoms. Just to give you a flavour, one modern software 
package has the following choices. 

1. Amber 2 

2. Amber 3 

3. Amber for saccharides 

4. Amber 94 

5. Amber 96. 

4.8.5 OPLS (Optimized Potentials for Liquid Simulations) 

Like AMBER, OPLS is designed for calculations on amino acids and proteins. The easi¬ 
est thing is for me to quote part of the abstract to the keynote paper (Jorgensen and 
Tirado-Rives 1988). 

A complete set of inter molecular potential functions has been developed for use in computer 
simulations of proteins in their native environment. Parameters have been reported for 25 
peptide residues as well as the common neutral and charged terminal groups. The potential 
functions have the simple Coulomb plus Lennard-Jones form and are compatible with the 
widely used models for water, TIP4P, TIP3P and SPC. The parameters were obtained and 
tested primarily in conjunction with Monte Carlo statistical mechanics simulations of 36 pure 
organic liquids and numerous aqueous solutions of organic ions representative of subunits in 
the side chains and backbones of proteins... 

Improvement is apparent over the AMBER united-atom force field which has previously 
been demonstrated to be superior to many alternatives. 

I will explain about TIP and Monte Carlo in later chapters. Each atomic nucleus is an 
interaction site, except that CH„ groups are treated as united atoms centred on the carbon. 
Hydrogen bonds are not given any special treatment, and no special account is taken of 
lone pairs. 

4.8.6 Johnson 

I mentioned earlier the existence of a number of specialist force fields. To take an even 
more extreme example, the Johnson force field (Johnson 1964) is specific to solid-state 
models involving the pure elements Fe, W and V. The pair potential terms are written 

U = di (R — hi) 3 + CiR + d l if £i < R < e 2 

= a 2 (R — bjf + c 2 R + d 2 if e 2 < R < e 3 

= a 3 (R — h 3 ) 3 + c 2 R + d 2 if e 3 < R < £4 

= 0 if £4 < R 


(4.23) 
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where R is the distance between a pair of atoms, e are characteristic distances and a, b, c 
and d are parameters. 

4.8.7 Unified Force Field (UFF) 

I have discussed so far a number of force fields of increasing degrees of special¬ 
ity and application. Let me finish this chapter by drawing your attention to the UFF; in 
the words of the authors Rappe et al. (1992), ‘a Full Periodic Table Force Field for Molecu¬ 
lar Mechanics and Molecular Dynamics Simulations’. Molecular dynamics is the subject 
of a later chapter in this book. I commend this article to you as a model of logic and lucidity 
and it illustrates beautifully all the principles I have discussed so far. Here is the abstract 
(Rappe et al. 1992): 

A new molecular mechanics force field, the universal force field (UFF), is described wherein 
the force field parameters are estimated using general rules based only on the element, its 
hybridization and its connectivity. The force field functional form, parameters, and generating 
formulas for the full periodic table are presented. 

In particular, the parameters used to generate the UFF include a set of hybridization- 
dependent atomic bond radii, a set of hybridization angles, van der Waals parameters, 
torsional and inversion barriers and a set of effective nuclear charges. 

Thus, the force field is represented by Equation (4.20); there is a slight clash of notation, 
for my ‘out-of-plane’ read ‘inversion’. If I focus on the bond stretch, UFF allows both the 
simple harmonic and Morse pair potentials. Both involve an ‘equilibrium’ value for the 
bond length, and this is one of the ‘parameters’ that has to be fixed. The authors took this 
to be the sum of atom type specific single bond data, plus a bond order correction, plus an 
electronegativity correction. 

The authors describe many simple and more difficult organics, together with a good 
number of metal-containing molecules, and conclude that their predicted bond lengths are 
good to 10 pm, and bond angles to about 7°. 
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5 


The Molecular Potential Energy 
Surface 


For one-dimensional problems, we speak about a molecular potential energy curve. 
The simple potential energy curves we have met so far have all shown a single min¬ 
imum. From now on, life gets more complicated (or interesting, depending on your 
viewpoint). 


5.1 Multiple Minima 

The plot in Figure 5.1 shows how the ethane molecular potential varies with dihedral 
angle. The figure shows a full rotation of 360°; all the remaining geometrical variables 
were kept constant. Note that there are three identical minima (and of course three identical 
maxima), and the differences between maxima and minima are all the same. The chemical 
interpretation is that these three minima correspond to conformers where the hydrogens are 
as far apart as possible (i.e. in the tram position). The maxima correspond to conformers 
where the C-H bonds eclipse each other. Multiple minima are common in potential energy 
surface studies, as we will see. 

Consider now the substituted ethane CH 2 C1-CH 2 C1. Aplot of the potential energy versus 
the C1C-CC1 dihedral angle is shown in Figure 5.2. There are three minima, one lower than 
the other two. The three minima are referred to as local minima and the minimum at 180° 
is called the global minimum. The global minimum corresponds to a conformation with 
the two chlorines as far apart as possible. The two other minima correspond to conformers 
where each chlorine is trans to hydrogen. 
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Figure 5. 1 Ethane dihedral motion 
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Figure 5.2 Plot of potential energy versus dihedral angle for CH 2 CI-CH 2 Cl 


5.2 Saddle Points 

One-dimensional potential energy curves show minima, maxima and points of inflexion. 
Two- and higher-dimensional surfaces often show a new feature called a saddle point. 
Saddle points are a maximum in some variables and a minimum in the remainder. The one 
type of saddle point that interests chemists is where we have a minimum in all variables but 
one, and a maximum in this remaining one. This corresponds to a transition state in theories 
of reaction mechanisms. I just mention it here, despite the fact that molecular mechanics 
cannot properly treat bond breaking or bond making. I am not going to deal in detail with 
saddle points until I have given you a firm grounding in quantum chemistry, but you should 
be aware of their existence. 
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There are also points where the molecular potential energy is a maximum. We are rarely 
interested in these. 

Figure 5.3 shows a saddle point (for the function f(x,y) = x 2 — y 2 ). The vertical axis 
shows the function values, and the two horizontal axes are x and y. The apocryphal reason 
why such points are called saddle points is that the figure above resembles a horse’s saddle. 
Molecular potential energy surfaces generally contain all possible features and you might 
like to consider the example shown in Figure 5.4, which shows two (equal) minima and a 
saddle point in between. 



Figure 5.3 Saddle point 



Figure 5.4 Two minima and a saddle point 
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In this chapter I want to show you how we locate stationary points on the molecular 
potential energy surface, and how we characterize them. I will start with the second problem, 
because it is straightforward. 


5.3 Characterization 

For a linear molecule with N atoms, there are in total 3/V degrees of freedom. Three of 
these are translational degrees of freedom, three are rotational (two for a linear molecule) 
and the remaining p = 3N — 6 (3/V — 5 if linear) are vibrational degrees of freedom. We 
therefore need p independent variables to describe them, and these are known as the internal 
coordinates. It is traditional to denote such variables as q\ if I write them q u q 2 ,..., q p 
then the molecular potential energy will depend on these q’s and the normal coordinates 
will be linear combinations of them. For the minute, do not worry about the nature of these 
variables. In Chapter 4,1 showed how to collect such variables into the column vector q 
(a matrix of dimension p x 1): 


q = 


and I also defined the gradient of U, g = g 
a symmetric square matrix: 

dU \ 
dqi 
3 U 

g = dqi , H= 

3 U 

K dq p ) 

The molecular potential energy depenc 
stationary point, the gradient is zero: it is as easy as that. In order to characterize the 
stationary point, we have to find the eigenvalues of the Hessian calculated at that point. If 
the eigenvalues are all positive, the point is a minimum. If the eigenvalues are all negative, 
the point is a maximum. Otherwise the point is a saddle point. Saddle points of interest 
to chemists are those that are a minimum in all degrees of freedom except one, and it 
transpires that the Hessian has just one negative eigenvalues at such a point. They are 
sometimes called first-order saddle points. 


f qi N 
<12 

V 9p / 

•ad U, as a column vector, and its Hessian H as 


d 2 U 

3 2 U 

dqj 

dqidq p 

3 2 U 

3 2 U 

dq p dqi 

9q 2 


on the q’s, and I will write it as £/(q). At 


5.4 Finding Minima 

MM tends to be concerned with searching the molecular potential energy surfaces of very 
large molecules for minima (equilibrium structures). Many mathematicians, scientists and 
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engineers are concerned with related problems. The field of study is now an established part 
of applied mathematics, and is referred to as optimization theory. An optimization problem 
involves minimizing a function of several variables, possibly subject to restrictions on the 
values of the variables. There is no single universal algorithm for optimization, and it is 
necessary for us to spend a little time dealing with the different methods of interest. 

I should tell you that one speaks of optimization and less commonly of minimization or 
maximization; the latter two are equivalent, for maximization of function/ is the same as 
minimization of the function —/. 

The first MM optimization seems to have been carried out by Westheimer (1956), who 
did the calculations ‘by hand’. The first computer calculations seem to have been done by 
Hendrickson (1961). Neither of their methods was generally applicable to any molecule. 

Many algorithms have been developed over a number of years for the location of station¬ 
ary points, some of which are suitable for MM calculations, some of which are suitable for 
quantum mechanical calculations and many of which are not particularly suited to either. 
In MM we tend to deal with large molecules and consequently the molecular potential 
energy function will depend on hundreds if not thousands of variables. On the other hand, 
evaluation of the potential energy at each point on the hypersurface is simple. 

Different considerations apply to quantum chemical calculations, to be dealt with later in 
the text. Calculation of the energy at points on the surface is far from easy but the number 
of variables tends to be smaller. 

As stated above, transition states are special cases of saddle points; they are stationary 
points on the surface where the Hessian has just one negative eigenvalue. Transition state 
searching is a hot topic in chemistry, and a number of specialist algorithms have been 
proposed. I will deal with transition states in Chapter 22. 

5,5 Multivariate Grid Search 

This is the oldest method for finding minima, and it has a long history. What we do is the 
following. 

1. Choose a suitable grid for the variables. 

2. Choose a starting point A on the grid. 

3. For each variable q u q 2 ,..., q p evaluate the molecular potential energy U at the two 
points surrounding A (as determined by the grid size). 

4. Select the new point for which U is a minimum, and repeat steps 3 and 4 until the local 
minimum is identified. 

The method is intuitive, and it is apparent that a local minimum will eventually be found. 

5.5.1 Univariate Search 

This is sometimes called the cyclic search, for we perform successive one-dimensional 
searches for each of the variables in turn. 

1. Choose a starting point A. 

2. Minimize U(q) for each variable q u q 2 ,...,q p in turn. 

3. Repeat the cycle as necessary. 



70 Molecular Modelling for Beginners 


The second step is the crucial one, and a favourite strategy is quadratic interpolation. 
Suppose that X| , x 2 and x 3 are three successive values of one of the variables q, (say), chosen 
such that they bracket a minimum in that particular variable, as shown in Figure 5.5. We fit 
the three points to a quadratic, and a little algebra shows that the minimum a is given by 

1 U fo) (*3 - * 2 2 ) + U fe) (*? ~ xp + U (x 3 ) (x\ - x?) 

01 2 U (x,) (x 3 -x 2 ) + U (x 2 ) (Xi -x 3 ) + U (x 3 ) (x 2 - x,) 

The method is also known as a line search. 



Figure 5.5 Univariate minimization 


5.6 Derivative Methods 

It is conventional to divide optimization algorithms into those methods that make use of 
derivatives (gradients and Hessians) and those that do not. Neither of the two methods 
discussed so far make use of derivatives. Some authors use a further subdivision into first- 
order derivative methods (where we make use of the gradient) and second-order derivative 
methods (where we use both the gradient and the Hessian). 

There are many algorithms in the literature; I can only give you a flavour. 


5.7 First-Order Methods 
5.7.1 Steepest Descent 

This first-order derivative scheme for locating minima on molecular potential energy 
surfaces was put forward by Wiberg (1965). His basic algorithm can be summarized as 
follows. 

1. Calculate U for the initial structure. 

2. Calculate U for structures where each atom is moved along the x, y and z axes by a 
small increment. Movement of some atoms will lead to a small change in U, whilst 
movements of other atoms will lead to a large change in U. (The important quantity is 
clearly the gradient.) 
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3. Move the atoms to new positions such that the energy U decreases by the maximum 
possible amount. 

4. Repeat the relevant steps above until a local minimum is found. 

Wiberg’s method would today be classified as ‘steepest descent’; the idea is that we start 
from point A on a molecular potential energy surface and identify the fastest way down 
the surface to the local minimum. Figure 5.6 shows a two-dimensional potential energy 
surface. The negative of grad U at point A gives the maximum rate of decrease of U, and 
also the direction in which to proceed (the gradient vector is perpendicular to the contour 
at A). The only problem is to choose a step length; this can be done by a line search along the 
vector indicated, and it is clear that U will pass through a linear minimum around point B. 



We repeat the process from point B until the local minimum is (eventually) reached. It can 
be proved that every step is necessarily at right angles to the one before it, and this leads to 
one of the infuriating characteristics of the steepest descent method: it takes very many tiny 
steps when proceeding down a long narrow valley, as it is forced to make a right-angled 
turn at every point, even though that might not be the best route to the minimum. 

At this point, I should introduce some notation that is universally understood by workers 
in the field of optimization. Like it or not, you will come across it when you read the 
technical literature. 

All of the algorithms that I will describe are iterative ; that is, we start from some initial 
point A on a surface, and then move on in cycles, according to an algorithm, hopefully 
toward a stationary point. Each cycle of the calculation is called an iteration, and people 
often use the symbol k to keep track of the iteration count. I am going to follow common 
practice and add a superscript ‘(k)’ to variables in order to indicate this iteration count. 
Sometimes I will just write it as ‘k\ when there is no possibility that you will regard the 
iteration count as meaning ‘to the power of’. 

5.7.2 Conjugate Gradients 

Starting from point q® (where k is the iteration count), we move in a direction given by 
the vector 


y® = - g ® + 1 


(5.2) 
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where g® is the gradient vector at point q® and y® is a scalar given by 


_ (g®) T g® 

(g(*-l)) T g ( C-l) 


(5.3) 


I have used the superscript T to denote the transpose of a matrix. The line search method 
is then used to determine the distance to be moved along this vector. This algorithm was 
first proposed by Fletcher and Reeves (1964). 

Conjugate gradients methods produce a set of directions that overcome the oscillatory 
behaviour of steepest descents in narrow valleys. Successive directions are not at right 
angles to each other. Such methods are also referred to as conjugate direction methods. 

Polak and Ribiere (1969) proposed an alternative form for the scalar y ®: 


|g» CT .g»- 1) ) T j |# 

( g (*- 1) ) T g(*-D 


(5.4) 


For a purely quadratic function, their method is identical to the original Fletcher-Reeves 
algorithm. The authors pointed out that most U ’s encountered in chemistry are only 
approximately quadratic, and that their method was therefore superior. 


5.8 Second-Order Methods 
5.8.1 Newton-Raphson 

Second-order methods use not only the gradient but also the Hessian to locate a min¬ 
imum. Before launching into any detail about second-order methods, it is worth spending 
a little time discussing the well-known Newton-Raphson method for finding the roots of 
an equation of a single variable. This is a simple illustration of a second-order method. For 
illustration, consider the function 

f (x) = x exp (-x 2 ) 

shown as the full curve in Figure 5.7. It is obvious by inspection that our chosen function 
has roots at x = 0 and at +oo and — oo, but suppose for the sake of argument that we do 
not know that the function had a root at x = 0. We might make a guess that the function 
has a root at (say) x = 0.4. This is our first guess, so I set the iteration count = 1 and write 
the guess x (,) (also, the function has a value of/ (1) = 0.3409). Thus we have 

x (1) = 0.4, f m = 0.3409 
The function can be differentiated to give 

^ = (l-2r) exp (—x 2 ) 

and the gradient g (1) at x (1) = 0.4 has a value 0.5795. The tangent line at x (1) = 0.4 is 
therefore 


y = 0.5795x +0.1091 
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which I have shown as a full straight line on Figure 5.7. A little algebra shows that this line 
crosses the x axis at 



This crosses the x axis at —0.1883 and this value is taken as the next best estimate x <2> of 
the root. I then draw the tangent line atx <2) , which is shown dotted in Figure 5.7, and so on. 
Table 5.1 summarizes the iterations. 



Table 5.1 Newton-Raphson iterations 


k 


fOt) 

g ( « 

1 

0.4 

0.3409 

0.5795 

2 

-0.1882 

-0.1817 

0.8968 

3 

1.4357x10 -2 

1.4354x10- 2 

0.9994 

4 

—5.9204x10 -6 

—5.9204x10 -6 

1.0000 

5 

0.0000 

0.0000 

1.0000 


There are a few points worth noticing. First, the convergence is rapid. In fact, for a 
quadratic function the Newton-Raphson method will locate the nearest root in just one step. 

Second, the choice of starting point is crucial. If I start with x = ±0.5, the successive 
estimates simply oscillate between ±0.5 and —0.5. If I start with |x] < 0.5, then the method 
converges to the root x = 0. If I start with |x| > 0.5 then we find the infinite roots. 

The method can be easily modified to search for stationary points, which are characterized 
by a zero gradient. The iteration formula 




(5.5) 
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becomes 

= x® - g {k> /H (k) (5.6) 

where H (k) is the value of the second derivative at the current point x®. 

For this particular function, we have 

H (x) = (4x 3 - 6x) exp (-x 2 ) 

and I can now repeat the argument given above, but this time searching for stationary points 
rather than roots of the equation. 

Figure 5.8 shows the gradient (the full curve), and the first two iterations of the Newton- 
Raphson algorithm for the stationary point, starting at x (1) = 0.2. The gradient is zero at 
infinity and atx = ±0.7071. 



In the case of a function of many variables x u x 2 ,..., x„, the Newton-Raphson 
optimization algorithm can be written 

X (*+i) = x m _ ( H ®) _1 g® (5.7) 

where I have collected the x’s into a column vector X and so on. The formula can be easily 
derived from the Taylor expansion. 

We therefore have to calculate the gradient and the Hessian at each successive point in 
the iterations. Normally, the algorithm needs an infinite number of steps but it will find the 
minimum of any quadratic function in just one step, starting at an arbitrary point on the 
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surface. For example, if we consider the function/(x, y) = x 2 +y 2 which has a minimum 
at x = 0, y = 0 we have 



If we start from the point (2, 2) we calculate 



which is a null vector. 

In practice molecular potential energy surfaces are rarely quadratic and so an infinite 
number of steps will be required. We simply stop the calculation once a given number of 
decimal places have been achieved. 

5.8.2 Block Diagonal Newton-Raphson 

There are a number of variations on the Newton-Raphson method, many of which aim 
to eliminate the need to calculate the full Hessian. A widely used algorithm is the block 
diagonal Newton-Raphson method, where just one atom is moved at each iteration. This 
means that all the elements of the Hessian are zero except for a block along the diagonal 
describing the atom in question. 

5.8.3 Quasi-Newton-Raphson 

In addition, there are a number of so-called quasi-Newton-Raphson methods that gradually 
build up the inverse Hessian in successive iterations. At each iteration, the vector X® is 
updated to X (i+1) 

X<*+1) = x® - (H®)~‘ g® 

using the gradient and the current estimate of the inverse Hessian. Having made the move to 
the new position, H 1 is updated from its value at the previous step by an approximate pro¬ 
cedure depending on the algorithm employed. The methods of Davidon-Fletcher-Powell, 
Broyden-Fletcher-Goldfarb-Shanno and Murtaugh-Sargent are often encountered, but 
there are many others. 

5.8.4 Fletcher-Powell Algorithm 

The Fletcher-Powell algorithm (Fletcher and Powell 1963) is a derivative method where 
elements of the gradient and the Hessian are estimated numerically each cycle. Essentially, 
it assumes that the surface is quadratic around each chosen point, and finds the minimum 
for the approximate quadratic surface. Steps in the algorithm are as follows. 
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1. Calculate the energy U m for an initial geometry X (1) , and at positive and negative 
displacements for each of the coordinates. 

2. Fit a quadratic for each of the coordinates according to the formula 

U(X) = U k + J2 (*? X - x f) + \ H U (X - X) 2 ) ( 5 . 8 ) 

(this essentially gives numerical estimates of the gradient and the Hessian; I have dropped 
the brackets round the iteration count for clarity). 

3. Find a minimum of this expression; we have 


g- + H k u (Xi - X k ) = 0 (5.9) 


cf = X, -Xf = 


g[ 

Hu 


The last term gives the correction to coordinate x,; if these are small enough then stop. 

4. Calculate the energy at points X*, X* + c k and X* + 2c*. 

5. Fit a quadratic to these three points as above. 

6. Find the energy minimum, as above. This gives point X* +1 on the surface. 

7. Calculate the gradient g* +1 at this point, increase the iteration count and go back to step 3. 


5.9 Choice of Method 

The choice of algorithm is dictated by a number of factors, including the storage and 
computing requirements, the relative speeds at which the various parts of the calculation 
can be performed and the availability of an analytical gradient and Hessian. Analytic first 
and second derivatives are easily evaluated for MM force fields; the only problem might be 
the physical size of the Hessian. For this reason, MM calculations on large systems are often 
performed using steepest descent and conjugate gradients. The Newton-Raphson method 
is popular for smaller systems, although the method can have problems with structures that 
are far from a minimum. For this reason, it is usual to perform a few iterations using (for 
example) steepest descent before switching to Newton-Raphson. The terms ‘large’ and 
‘small’ when applied to the size of a molecule are of course completely relative and are 
dependent on the computer power available when you do the calculation. 

In cases where the differentiation cannot be done analytically, it is always possible to 
estimate a gradient numerically; for example in the case of a function of one variable 
/ 4 A ^ /fa + A)-/fo) 

\dv ) X=X1 ~ A 

where A is small. Algorithms that rely on numerical estimates of the derivatives need more 
function evaluations than would otherwise be the case, so there is a delicate trade-off in 
computer time. It is generally supposed that gradient methods are superior to nongradient 
methods, and it is also generally thought to be advantageous to have an analytical expression 
for the gradient and Hessian. 
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5.10 The Z-Matrix 

For a nonlinear molecule of N atoms, there are p = 3N — 6 vibrational degrees of freedom 
that should be described by p independent coordinates q u q 2 ,..., q p and at first sight 
all should be varied in a geometry optimization. These coordinates are often defined to 
modelling packages using the so-called Z-matrix, which gives a way of building up a 
molecular geometry in terms of valence descriptors such as bond lengths, bond angles and 
dihedral angles. Two examples will help to make things clear. First of all water, for which I 
will take a bond length of 95.6 pm (0.956 A) and an H-O-H bond angle of 104° (Figure 5.9). 


Z-Matrix 

Oxy 

HI Oxy ROH 
H2 Oxy ROH HI Ang 

ROH 0.956 
Ang 104. 


Figure 5.9 Z-matrix for water 


I can start my Z-matrix at any atom, and that atom occupies a line by itself. I choose 
oxygen, which I have called Oxy. HI is joined to oxygen with a bond length of 
ROH (0.956 A). I have assumed that the bond lengths are equal. H2 is also joined to 
oxygen with a bond length of 0.956 A, and the H2-oxygen-Hl bond angle is Ang (104°). 

Notice that I have subconsciously added a symmetry constraint by requiring that the two 
bond lengths are equal. This means that I am effectively reducing the number of degrees 
of freedom, for the purposes of geometry optimization. We will see later that this is not 
necessary. 

I had no particular software in mind when I wrote this example (although it would be 
fine for Gaussian 03, as we will see in later chapters). There are many Z-matrix pits for the 
unwary to fall into, some simple and some subtle. Two simple ones: I did not call the first 
atom ‘Oxygen’ because many packages limit you to a small number of characters for atomic 
descriptors; also, I was careful to include a decimal point for the starting values of ROH 
and Ang, since many packages still expect floating-point numbers rather than integers. 

For molecules having more than three atoms, we also have to define the dihedral angles 
in addition to bond lengths and bond angles. Figure 5.10 shows a Z-matrix for ethene, where 
I have made a lot of assumptions about the molecular geometry. All the C-H bond lengths 
are equal, and all the HCC bond angles are also taken to be equal. I have also assumed that 
the molecule is planar (as indeed it is, in its electronic ground state). 

I should admit that Z-matrices are not easy things to write for large molecules, and in 
any case downloads of molecular geometries (for example, from the Protein Data Bank) 
are usually in Cartesian coordinates. There is also a problem to be addressed, namely that 
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Z-Matrix 


Cl 

C2 Cl RCC 
HI Cl RCH 
H2 C2 RCH 
H3 C2 RCH 
H4 Cl RCH 


C2 AngleHCC 
Cl AngleHCC HI 180. 
Cl AngleHCC HI 0. 
C2 AngleHCC H3 180. 


RCC 1.34 
RCH 1.08 
AngleHCC 120. 


Figure 5.10 Ethene Z-matrix 


there are at most 3N — 6 vibrational degrees of freedom for a nonlinear molecule that is 
described by 3N Cartesian (or other) coordinates. The Z-matrix has to be correctly written 
in order to describe these 3N — 6 internal coordinates, and a frequent cause of problems in 
optimizations specified by a Z-matrix was (and still is) the specification of too many or too 
few internal coordinates. Too few coordinates correspond to a constraint on the geometry 
and means that the full surface is not searched. Too many coordinates result in a redundancy 
leading to zero eigenvalues in the Hessian (which cannot then be inverted). 

In the early days of geometry optimizations it was usual to take all kinds of shortcuts in 
order to save on computer resources. For example, we would routinely take all the C-H 
bond lengths equal in substituted benzenes, yet there are subtle, systematic differences and 
these should come out of a respectable calculation. Computer resources are much cheaper 
than was once the case, and the assumptions of constant bond length, etc. are no longer 
needed. 

The Z-matrix method is often claimed to be intuitive to chemists because it uses the 
everyday concepts of bond lengths, bond angles and so on. It was always an acquired 
taste and in any case many databases give molecular geometries in terms of Cartesian 
coordinates, not internal ones. 


5.11 The End of the Z-Matrix 

Geometry optimization is of major importance in modem molecular modelling. Most of the 
early packages used internal coordinates as input by the Z-matrix. Practically all modem 
(gradient) optimization procedures require calculation of the Hessian H and/or its inverse. 
In practice, it is usual to make an estimate and update these estimates at every iteration. 
Sometimes, the initial Hessian is taken to be the unit matrix, sometimes not. A great strength 
of the internal coordinate method is that construction of the initial Hessian can be based on 
chemical ideas; the individual diagonal elements of H are identified as bond stretching and 
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bond bending force constants, etc. Also, the redundant translational and rotational degrees 
of freedom have already been eliminated. 

Baker and Hehre (1991) investigated the possibility of performing gradient geometry 
optimization directly in terms of Cartesian coordinates. Their key finding concerned the 
identification of a suitable initial Hessian, together with a strategy for updates. They reported 
on a test set of 20 molecules, and argued that optimization in Cartesian coordinates can be 
just as efficient as optimization in internal coordinates, provided due care is taken of the 
redundancies. 

There are actually two mathematical problems here, both to do with the Newton-Raphson 
formula. Modem optimizations use gradient techniques, and the formula 

x (t+ i> = X<*> - (H<T V> 

demonstrates that we need to know the gradient vector and the Hessian in order to progress 
iterations. In the discussion above, I was careful to stress the use of p independent variables 

qu # 2 , • • •, q P - 

As stressed many times, there are p = 3/V — 6 independent internal coordinates for a 
nonlinear molecule but 3/V Cartesian coordinates. The familiar Wilson B matrix relates these 


q = BX (5.10) 

B has 3 N rows and p columns, and the rows of B are linearly dependent. The molecular 
potential energy depends on the q’ s, and also on the A’s but we need to be careful to dis¬ 
tinguish between dependent and independent variables. We can certainly write gradient 
vectors in terms of the two sets of variables: 




( 


(grad U) c = 


dU 

dU 

dX 2 


\ 


dU 

\ 9 q P ) 


dU 

\9X^J 


where I have added a subscript ‘C’ to show that the differentiations are with respect to the 
3 N Cartesian coordinates, but it is grad U that appears in optimization expressions and not 
(grad U) c - We therefore have to relate the two. 

Starting from q = BX we can show that the gradients are related by 


B T grad U = (grad U) c 


Remember that we want an expression for grad U; if B were square and invertible we 
would just write 

grad U = (B t ) _1 (grad U) c 

but unfortunately B is rectangular. We therefore appeal to the mathematical concept of a 
generalized inverse. Consider the 3 N x 3 N matrix G = BuB T , where u is an arbitrary 
nonsingular p x p matrix. The generalized inverse of G, written G -1 , is a matrix with the 
property that GG -1 is a diagonal matrix with a certain number of l’s and a certain number 
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of 0’s along the diagonal. In our case, GG -1 will have p 1 ’s along the diagonal and 3N — p 
zeros. There are many algorithms for calculating such a generalized inverse. The zeros might 
actually be tiny numbers, depending on the accuracy of the numerical algorithm used. 

It can be shown that the gradient vector referred to the independent variables is related 
to the Coordinate gradient vector by 

grad U = G -1 Bu (grad U) c (5.11) 

Similar considerations apply to the Hessian, and formulae are available in the literature. 


5.12 Redundant Internal Coordinates 

All coordinate systems are equal in principle, but I have stressed above that: 

• they should ideally give transferable force constants from molecule to molecule; 

• they should ideally be capable of being represented as harmonic terms in U, i.e. cubic 
and higher corrections should not be needed. 

These requirements can be best satisfied by local internal valence coordinates such as bond 
lengths, bond angles and dihedral angles. The expression ‘local’ in this context means that 
the coordinates should extend to only a few atoms. 

Pulay et al. (1979) and Schlegel and co-workers (see Peng et al. 1994, 1996) both 
investigated the use of redundant internal coordinates for gradient optimizations. Pulay 
et al. defined an internal coordinate system similar to that used by vibrational spectroscop- 
ists. It minimizes the number of redundancies by using local symmetry coordinates about 
each atom and special coordinates for ring deformations, ring fusions, etc. Schlegel and 
co-workers used a simpler set of internal coordinates composed of all bond lengths, valence 
angles and dihedral angles. 

The mathematical considerations outlined above also apply here. 

Packages like Gaussian 03 offer a choice between Z-matrix and Cartesian coordinates 
for input of the molecular geometry. Gaussian 03 then routinely uses redundant internal 
coordinates for the optimizations rather than the Z-matrix. 
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6 

Molecular Mechanics Examples 


As a rough guide, molecular mechanics (MM) is used these days to study possible 
equilibrium conformations of large molecules. In this chapter I will give an example of 
what can be routinely done at this level of theory. 

There are many packages on the market, and everyone has their favourite (or has 
it imposed on them by their employer). My two favourite packages are Gaussian 03 
(http://www.gaussian.com) and HyperChem v7 (http://www.hyper.com). The latest release 
of HyperChem is actually v8. In my opinion, everything boils down to the graphical user 
interface (GUI) for MM applications on large molecules and HyperChem has a first-class 
integrated GUI. You do not need a third party package to input and visualize the calculations. 

So for this chapter I am going to use HyperChem and treat the very simple molecule 
phenylanine (Figure 6.1) discussed earlier in the book. 

0 


nh 2 



Phenylanine 

Figure 6.1 Structure of phenylanine 


6.1 Geometry Optimization 

Geometry optimization requires a starting structure. I mentioned the Z-matrix in Chapter 5, 
and that is how molecules were input before the days of the GUI. We have moved along 
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since then, and to get started I simply built the molecule onscreen in HyperChem using the 
‘build’ option (Figure 6.2). 



Figure 6.2 HyperChem screen grab 

You probably noticed the single chiral centre; most packages have an automatic inversion 
feature in case you want to switch between D and L forms. 

Most packages also give the so-called ‘rendering’ as an option. I chose ‘balls and cyl¬ 
inders’ but you can experiment to get whatever pleases you best. There is a choice of 
optimization method and a choice of force field. AMBER seemed the appropriate choice 
for an amino acid, and I easily located a stationary point with energy 8.678411 kcalmol -1 . 

This energy is not for direct comparison with experiment; only differences are of any 
significance in MM calculations. In this example, I am going to use the energy as a relative 
marker in our search for the global minimum. 


6.2 Conformation Searches 

The stationary point located above is of course a local minimum. Figure 6.3 shows the 
results we might expect to find from a conformational search on a (hypothetical) molecule. 
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Repeating the optimization from different starting points has identified many local minima 
and the conformation corresponding to energy e, appears to be the global minimum. 


e i 


k B T 


Figure 6.3 Conformation search 


Our MM calculations refer to the molecule at 0 K, whilst we would normally concern 
ourselves with molecules at room temperature. At 0 K, we would find all molecules in 
the lowest energy conformation but at other temperatures we will find certain fractions 
of the molecules in higher energy conformations. These fractions are determined by the 
Boltzmann factor 



where k B is the Boltzmann constant. I have therefore drawn a vertical line on the figure 
of size k B T, and we see that the first four conformations lie close together (compared to 
the yardstick k B T), and so we expect that all four would make a significant contribution to 
measured properties of the molecule at temperature T. 

The remaining groups of conformations are far away from the first four in terms of the 
yardstick k B T, and so can be discarded. 

Several approaches have been applied to the problem of automatically identifying the 
low-energy conformations of a given molecule. Molecular flexibility is usually associated 
with rotation of unhindered bond dihedral angles, with little change in bond lengths and 
bond angles. A common theme is therefore to limit the exploration to those parameters that 
have the least effect on the energy. Another strategy is to automatically scan a possible 
structure for unfavourably close nonbonded atoms and unlikely ring closure bond lengths 
before optimization. 

A choice then has to be made to decide whether the starting structure should be the same 
for every optimization, or each optimization should start with the structure found by the 
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previous successful optimization. The latter choice is often favoured, on the grounds that 
one low-energy conformation is pretty much like another, and starting from one successful 
geometry will tend to keep the search in a low-energy region of the potential surface. 

Variations in the parameters can be either random or systematic; in the case of systematic 
variations, it is usual to scan the surface first at Tow resolution’, then to refine likely looking 
regions at high resolution. In the case of random variations, a common problem is that as the 
calculation progresses, more and more of the trial structures lead to conformations that have 
already been examined. Many modern software packages have sophisticated algorithms that 
can recognize identical conformations, and so reject them from the global search. They also 
automatically reject conformations that are too high in energy, compared to k B T. 

Figure 6.4 shows a run using the ‘Conformational Search’ option of HyperChem. I had 
to decide which torsional (dihedral) angles to vary, and I chose just two for this illustrative 
calculation: these are called ‘4-7-8-10’ and ‘5-4-7-8’, with the atom numbering shown. 



The ‘Found’ table entry shows the number of times that each particular conformation has 
been found, and ‘Used’ shows the number of times it has been used as a starting structure. 
Energies are recorded as kcal mol -1 . 


6.3 Amino Acids 

Whilst on the subject of amino acids, Hyperchem has a database of the amino acid residues 
that can be quickly joined together to construct a chain. The p K of most amino acids is 
such that the molecules exist in zwitterion form in aqueous solution, and there is a ‘Make 
Zwitterion’ option. It can be argued that one should include any ‘formal’ atom charges such 
as the zwitterion charges explicitly, and in the case of phenylanine I would modify the 
formal charge on each oxygen atom to be — 1 / 2 electron and add a charge of +1 electron on 
the nitrogen. 
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6.3.1 Molecular Structure Databases 

Molecular geometries can be determined for gas-phase molecules by microwave spectro¬ 
scopy and by electron diffraction. In the solid state, the field of structure determination is 
dominated by X-ray and neutron diffraction and very many crystal structures are known. 
Nuclear magnetic resonance also has a role to play, especially for proteins. All of these 
topics are well discussed in every university-level general chemistry text. 

Over the years, a vast number of molecular structures have been determined and there 
are several well-known structural databases. One is the Cambridge Structural Database 
(CSD), which is supported by the Cambridge Crystallographic Data Centre (CCDC). The 
CCDC was established in 1965 to undertake the compilation of a computerized database 
containing comprehensive data for organic and metal-organic compounds studied by X-ray 
and neutron diffraction. It was originally funded as part of the UK contribution to interna¬ 
tional data compilation. According to its mission statement, the CCDC serves the scientific 
community through the acquisition, evaluation, dissemination and use of the world’s output 
of small-molecule crystal structures. 

For each entry in the CSD, three types of information are stored. First is the bibliographic 
information: who reported the crystal structure, where they reported it, and so on. Next 
comes the connectivity data; this is a list showing which atom is bonded to which in the 
molecule. Finally the molecular geometry and the crystal structure are given. The molecular 
geometry consists of Cartesian coordinates. The database can be easily reached through 
the Internet, but individual records can only be accessed on a fee-paying basis. 

The Brookhaven Protein Data Bank (PDB) is the single worldwide repository for the 
processing and distribution of three-dimensional biological macromolecular structural data. 
It is operated by the Research Collaboratory for Structural Bioinformatics. At the time of 
writing, there were 49 048 structures in the databank, relating to proteins, nucleic acids, 
protein-nucleic acid complexes and viruses. The databank is available free of charge. 

Information can be retrieved from the main website. A four-character alphanumeric 
identifier such as 1PCN represents each structure. The PDB database can be searched using 
a number of techniques, all of which are described in detail at the homepage. 


6.3.2 The .pdb File Format 

The PDB file format (.pdb) is widely used to report and distribute molecular structure data. 
Atypical .pdb file for phenylanine would start with bibliographic data, then move on to the 
Cartesian coordinates (expressed in angstroms and relative to an arbitrary reference frame) 
and connectivity data as shown below. 


HETATM 1 C 1 

HETATM 2 C 2 

HETATM 3 C 3 

HETATM 4 C 4 

HETATM 5 C 5 

HETATM 6 C 6 

HETATM 7 C 7 

HETATM 8 C 8 


-1.576 -0.433 0.004 

-1.301 0.777 0.643 

-0.072 1.410 0.444 

0.898 0.848 -0.394 

0.609 -0.365 -1.029 
-0.617 -1.004 -0.834 
2.227 1.537 -0.618 

3.352 0.904 0.217 
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HETATM 9 C 9 
HETATM 10 N 10 

etc.etc. 

CONECT 1 2 6 13 
CONECT 2 1 3 14 

CONECT 3 2 4 15 
CONECT 4357 


4.694 1.557 -0.049 
3.025 1.002 1.616 


etc. etc. 


END 

Records 1 through 23 identify the atoms; so, for example, atom 1 is a carbon. The ‘CONECT 
12 6 13’ record tells us that atom 1 (C) is joined to atoms 2, 6 and 13. 


6.4 QSAR 

Quantitative structure-activity relationships (QSARs) are attempts to correlate molecular 
structure with chemical or biochemical activity. I will use the remainder of this chapter 
to give you a flavour for the QSAR properties that are often routinely calculated in MM 
studies. 


6.4.1 Atomic Partial Charges 

Atomic charges can be calculated using the (costly) techniques of molecular quantum 
mechanics, as we will discover in later chapters. They are often calculated within the spirit of 
QSAR analyses using the partial equalization of orbital electronegativity (PEOE) approach 
of Gasteiger and Marsili (1980). The key concept of their method is atom electronegativity 
(often given the symbol x X and an attraction is that their calculation is rapid. Pauling and 
Yost (1932) first introduced this concept, and they tried to give numerical values to atomic 
electronegativities based on bond energy values. Mulliken (1934) put the concept on a 
firmer theoretical footing by relating an atomic electronegativity to the ionization energy I 
and the electron affinity E of the atom concerned: 

* = -(/ + £) ( 6 . 1 ) 

The only problem with Mulliken’s definition is that both E and I relate to some mythical state 
of an atom called the valence state. There have been many learned discussions relating to 
the concept of valence state, and such discussions still occasionally appear in the literature. 

Whatever the valence state of an atom is (depending on the atom), it rarely corresponds 
to a spectroscopic state and so cannot be studied experimentally. Electronegativities and 
valence states go hand in hand; they are part of our folklore and both are widely quoted in 
even the most elementary chemistry texts. 

Sanderson (1951) proposed that, on bond formation, atoms change their electron density 
until their electronegativities are equal. These ideas were further elaborated by Hinze and 
co-workers (1963), who introduced the concept of orbital electronegativity. This is the 
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electronegativity of a specific orbital in a given valence state. You will understand that 
I am introducing quantum mechanical concepts before we have finished our discussion of 
techniques based on classical mechanics; just bear with me. 

In the abstract of their paper, Gasteiger and Marsili (1980) state: 

A method is presented for the rapid calculation of atomic charges in o -bonded and nonconjug- 
ated 7i systems. Atoms are characterized by their orbital electronegativities. In the calculation, 
only the connectivities of the atoms are considered. Thus only the topology of a molecule is 
of importance. Through an iterative procedure partial equalization of orbital electronegativity 
is obtained. Excellent correlations of the atomic charges with core electron binding energies 
and with acidity constants are observed. 

These authors decided to relate the electronegativity of orbital v on atom i (x,>) to the 
total charge <2, on atom i as a quadratic 

Xi,v = ot, v + hMi + c iM 2 i ( 6 . 2 ) 
The three unknowns a, b and c were determined from values used at the time for orbital 
ionization energies and affinities. Typical values are listed in Table 6.1. 


Table 6.1 Parameters needed to calculate electronegativity 


Atom 

Hybridization 

a 

b 

c 

H 


7.17 

6.24 

-0.56 

C 

sp 3 

7.98 

9.18 

1.88 


sp 2 

8.79 

9.32 

1.51 


sp 1 

10.39 

9.45 

0.73 


The calculation is iterative; starting from a reference structure where all the atoms carry 
zero charge, electric charge is permitted to flow from the less to the more electronegat¬ 
ive atoms. The electronegativity of an atom decreases as it accumulates electric charge, 
according to the formula, and in the next iteration less charge will flow until eventually the 
electronegativity of each (charged) atom is equalized. At this point the flow of charge stops. 

Molecular mechanics is of course a classical treatment, and orbitals do not appear. The 
key equation is therefore written in terms of the atoms overall rather than in terms of orbital 
contributions: 

Xi = a; + biQi + CiQf (6.3) 

Atom electronegativities are adjusted at each iteration using the calculated charges, until 
self-consistency is achieved. Figure 6.5 shows typical results for phenylanine. Notice the 
negative charges on oxygen and the positive charge on nitrogen. Charges calculated in this 
way depend only on the atomic connectivity and not on the molecular geometry. 

6.4.2 Polarizabilities 

I introduced you to polarizabilities in Chapter 2, when we discussed the theory of inter- 
molecular forces. The experimental determination of a molecular polarizability is far 
from straightforward, especially if the molecule has little or no symmetry. The classical 
experimental route to the mean polarizability <a> is via the refractive index or relative 
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permittivity of a gas. Polarizabilities can be routinely calculated by quantum mechanical 
techniques, but the calculations are compute-intensive and this route is of no help if your aim 
in life is the high-throughput screening of very many potential pharmaceutical molecules. 

An insight into the order of magnitude of polarizabilities can be obtained by considering 
Figure 6.6, which represents a (mythical) atom. The nucleus of charge Q is at the coordinate 
origin, and the nucleus is surrounded by an electron cloud of radius a. The electrons are 
assumed to have a uniform density in space. The total electron charge is — Q and so the 
atom is electrically neutral. 



Figure 6.6 Construct needed to discuss polarizability 


When an external uniform electric field is applied, the nucleus is displaced a relative 
distance d in the direction of the field. At this point, the force on the nucleus QE is exactly 
balanced by the force exerted on the nucleus by the electron cloud. 
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According to Gauss’ electrostatic theorem, this is the same force as would be exerted 
if all the charge within a sphere of radius d were concentrated at the centre. This charge 
is —Q times the ratio of the volumes of spheres of radius a and d and so is —Qd 3 /a 3 . This 
gives a force of magnitude 

Ql d 3 1 1 _ Q 2 d 1 

a 3 4jre 0 d 2 4 tcs 0 a 3 

Hence the displacement d satisfies 


Q 2 d 

47T£ 0 « 3 


= QE 


The induced dipole is Qd and the polarizability is Qd/E and 


(6.4) 


Apart from the dimensional factor, the volume of the sphere determines the polarizability. 

Table 6.2 shows the experimental polarizabilities for three inert gas atoms, which 
illustrates the dependence on volume. 


Table 6.2 Polarizabilities of inert gases 


Inert gas 

o 

n 

£ 

He 

0.23 

Ne 

0.44 

Ar 

1.83 


Table 6.3 shows a comparison between typical <a> QSAR model calculations for a 
number of alkanes, together with the result of a serious quantum mechanical calculation. 
The latter are recorded in the ‘Accurate’ column. At the chosen level of quantum mechan¬ 
ical theory (BLYP/6-311-1—|-G(3d,2p) with optimized geometries, to be discussed in later 
chapters), the results are at least as reliable as those that can be deduced from spectroscopic 
experiments. The downside is that the butane calculation took approximately four hours on 
my home PC; the corresponding parameterized calculation took essentially no time at all. 


Table 6.3 Comparison of accurate quantum mechanical calculation with QSAR 


Alkane 

Accurate <a>/10 40 C 2 m 2 J 1 

QSAR <a>/10 -40 C 2 m 2 )” 1 

Methane 

2.831 

2.90 

Ethane 

4.904 

4.94 

Propane 

7.056 

6.99 

Butane 

9.214 

9.02 


It looks at first sight that there is an incremental effect. Adding a CH 2 increases the mean 
polarizability by about 2.13 x 10 -40 C 2 m 2 J -1 and there has been much speculation over the 
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years as to whether a molecular polarizability can generally be written as a sum of atomic 
contributions. 

Many atomic parameter schemes have been proposed, yet there were always exceptions 
that needed a complicated sub-rule in order to get agreement with experiment. Silberstein 
(1917) wrote in his famous paper of 1917 of 

... the ever growing hierarchy of rules indicating how to treat the exceptions to the law of 
additivity, although helpful to the chemist, is the clearest confession of non-additivity. 

The concept of bond polarizability was then introduced, in order to try and circumvent the 
additivity problem. For example, the C-H bond polarizability a CH is taken to be a quarter 
of the methane mean polarizability <a>, and for the alkanes of formula C„H 2 „ +2 we have 


{a (C„H 2 „ +2 )> = (n- 1) (a cc ) + (2n + 2) (a CH ) 


(6.5) 


from which it is possible to deduce a value for a (x ■ This method actually reproduces 
experimental data for hydrocarbons to within a few percent. But once again, such additiv¬ 
ity schemes fail when applied to molecules containing heteroatoms, and to hydrocarbons 
containing atoms in different states of hybridization. 

Miller and Savchik (1979) proposed a method whereby <a> was written as the square 
of a sum of atomic hybrid components. If they had known about 4-quantity electromagnetic 
formulae they would have written their formula 



(6.6) 


but in the event, the 47re 0 is missing from their paper. 

The number of electrons in the molecule is N, and r are parameters for each atom in its 
particular hybrid configuration. The summation is over all atoms in the molecule, and the 
r’s depend on the chemical hybridization but not on the chemical environment; they are 
constant for (for example) an sp 2 carbon atom. 

The authors based their algorithm on an old quantum mechanical treatment of 
dipole polarizability discussed in the classic book by Hirschfelder et al. (1954). The 
Miller-Savchik treatment is widely used in QSAR studies, and to give you a flavour I have 
recalculated the mean polarizabilities and included them in Table 6.3 under the QSAR 
heading. Given that each calculation took just a few seconds, you can see why QSAR 
practitioners think so highly of the Miller-Savchik technique. To put things in perspective, 
we should not be surprised to find such excellent agreement between the two sets of results 
in Table 6.3; it is heterosubstituted molecules that pose the severe test. 

6.4.3 Molecular Volume and Surface Area 

Molecular volumes are often calculated by a numerical integration grid technique that I can 
illustrate by considering the trivial problem of finding the volume of an atom whose van 
der Waals radius is R (the volume is of course *tcR 3 ). 

Figure 6.7 shows a two-dimensional representation of the atom whose van der Waals 
radius is R, surrounded by a three-dimensional grid of equally spaced points. The grid has its 
centre at the atom centre, and the edges of the grid correspond to the van der Waals radius. 
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For each grid point in turn, we calculate its distance from the centre and determine 
whether the grid point lies inside or outside the atom. If n is the total number of grid points 
and n a the number that he within the atom whose volume is V, we have 

V _ w a 

8 

For a polyatomic, we have to give special consideration to grid points that lie in the 
overlap region. Figure 6.8 shows two atoms A and B, with radii R A and R B . The overlap 
region is labelled X. 



For atom A, we know that the volume is \nR\- We now surround atom B with a grid as 
described above, and test each grid point in turn. If the grid point lies within sphere B, we 
test to see if it lies in region X and so has already been counted as part of the volume of 
atom A. The algorithm proceeds until all atoms have been considered in turn. The molecular 
volume is found by adding all atomic contributions. 

There are similar methods for the estimation of molecular surface area. 
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6.4.4 Log(P) 

When an organic solute is in equilibrium with water and a polar solvent such as octan-l-ol, 
it is observed that the ratio of the concentrations 

p [Solute in octan-l-ol] 

[Solute in water] 

is roughly constant. P is called the partition coefficient, and P can be used in predicting 
transmembrane transport properties, protein binding, receptor affinity and pharmacological 
activity. It is easy to determine P experimentally, but in the process of molecular design 
we have to deal with a high throughput of possible molecular structures and so numbers of 
attempts have been made to give simple models for predicting P. 

In studying the effect of structural variations on P, it was suggested that it had an additive- 
constitutive character. In the case of a it -substituent, people made use of Hammett’s ideas 
and wrote 

7T X = log 10 (P X ) - log 10 OP H ) (6.7) 

where P H is the partition coefficient for the parent compound and P x the partition coefficient 
for a molecule where X has been substituted for H. Workers in the field refer to ‘log(P)’ 
rather than ‘P’. 

It was originally envisaged that specific substituents would have the same contribution 
in different molecules. It has been demonstrated, however, that this hoped-for additivity 
does not even hold for many disubstituted benzenes. There are two classical methods for 
estimating log(P), both based on the assumed additivity: the/-constant method of Rekker 
(1976) and the fragment approach of Leo et al. (1975). 

Rekker defined an arbitrary set of terminal fragments using a database of some 1000 
compounds with known log(P). Linear regression was performed, and the regression coef¬ 
ficients designated group contributions. Deviations from the straight fines were corrected by 
the introduction of multiples of a so-called ‘magic factor’ that described special structural 
effects such as polar groups, etc. Log(P) is calculated from the fragmental contributions 
and the correction factors. 

Leo et al. derived their own set of terminal fragments, together with a great number of 
correction factors. 

Klopman and Iroff (1981) seem to be the first authors to make use of quantum mechanical 
molecular structure calculations. They performed calculations at the quantum mechanical 
MINDO/3 level of theory (to be discussed in Chapter 18) in order to calculate the atomic 
charge densities of a set of 61 simple organic molecules. They then developed a linear 
regression model that included the number of C, N, H and O atoms in the given molecule, 
the atomic charges on C, N and O, and certain ‘indicator’ variables n A , n T and n M designed 
to allow for the presence of acid/ester, nitrile and amide functionalities. They found 
log 10 (P) = 0.344 + 0.2078n H + 0.093n c - 2.119n N - 1.937n 0 

- 1.389^ - 17.284 + 0.73164 + 2.844n A + 0.910n T + 1.709n M (6.8) 
The terms involving q 2 represent the interaction of the solute and solvent. 

The method of Klopman and Iroff was a great step forward; there are many fewer 
parameters, it does not produce ambiguous results depending on an arbitrary choice of 
fragment scheme and it does not have a complicated correction scheme, apart from the 
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indicator variables. As we will see in Chapter 18, MINDO/3 calculations can be done very 
quickly even for large molecules. 

Bodor et al. (1989) decided to extend and enhance Klopman’s work by including the 
molecular volume, weight and surface area. In addition, they included the molecular electric 
dipole moment in the correlation, all possible products of pairs of charges and a further 
57 compounds in the analysis. They used a more up-to-date quantum mechanical model, 
AMI (again to be discussed in Chapter 18). The molecular volume and surface area were 
calculated by a grid method. Their equation is 

log 10 {P) = - 1.167 x lO -4 ^ 2 - 6.106 x 10 _2 5 + 14.870 2 - 43.670 + 0.9986/ alkane 

+ 9.57 x 10" 3 M w - 0.13000 - 4.929<2 ON - 12.17G 4 + 26.81£ 2 (6.9) 

- 7.416Q n - 4.551fi 4 + 17.92Q 2 - 4.03<2o + 27.273 

where S (cm 2 ) is the molecular surface area, O the molecular ovality, the indicator 
variable for alkanes (it is 1 if the molecule is an alkane, 0 otherwise), M w the relative 
molar mass, 0 the calculated electric dipole moment and Q 0N the sum of absolute values 
of the atomic charges on nitrogen and oxygen. The remaining Q’s differ from those used 
by Klopman and Iroff in that they are the square root of the sum of charges on the nitrogen 
or oxygen atoms. The most significant parameters are the volume and the surface area; the 
authors claim that this demonstrates that the most important contribution to log(P) is the 
creation of a hole in the structure of water. 


6.5 Problem Set 

A set of problems, together with solutions, is available on the associated website. 
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Sharing Out the Energy 


The structure of matter can be thought about on two different levels. On the one hand, there is 
the macroscopic view where we focus on equations such as the perfect gas equation of state 

pV = nRT (7.1) 

This approach deals with large chunks of matter and involves ideas like temperature, heat 
capacity and pressure that are familiar from everyday life. It is based on measurements 
taken with thermometers, pressure gauges and other instruments that are familiar in an 
ordinary science laboratory. 

On the other hand there is the microscopic approach where we focus on individual atoms 
and molecules. This approach involves rather abstract ideas and tends to be based on theory 
rather than experiment. Atoms and molecules can be studied directly by spectroscopic 
techniques, but they bring their own problems because we usually have to understand 
quantum mechanical principles before we can interpret the experimental findings. 

Another problem is that there are typically 10 23 particles in a macroscopic sample, a 
number that beggars understanding. 

Then again, the concept of temperature does not appear from Newton’s equations, nor 
does it appear in the Schrodinger equation. It is actually a measure of the way that energy 
is shared out between available quantum states, but more of this shortly. 

There must be a link between the two approaches, but what can it be? To be positive, 
things are not as bad as they might seem. What matters in a macroscopic sample are not 
the individual properties of the constituent atoms and molecules, rather their statistical 
properties. It does not matter whether we know which individual particle of 10 23 collides 
with the wall of a container when we want to understand the pressure; what does matter is 
the average behaviour of the 10 23 particles present. 

The two great pioneers of this subject were Ludwig Boltzmann (1844-1906) and Josiah 
Willard Gibbs (1839-1903). In fact they worked independently. Boltzmann’s tomb car¬ 
ries his famous equation relating a thermodynamic property (entropy, S ) to a statistical 
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quantity W. You might have come across Gibbs in a different context as the pioneer of 
modern vector calculus. 

The link between microscopic and macroscopic behaviour is provided by statistics, and 
in this case a special branch of mechanics called statistical mechanics. Statistics deals with 
the collection, organization and analysis of experimental data and it is important that you 
should have the basic concepts at your fingertips before progressing. 

The idea of a random number will appear again and again through the text; a sequence 
of random numbers is one where there is no discernible trend from one number to another. 
Your pocket calculator has a key labelled RAN#; press the key 10 times and make a note 
of the numbers generated. My results are given in Table 7.1; yours will be different. This 
is a series of random numbers; there is no discernible trend over the series and in this case 
they are equally distributed over the range of values 0 to 1. If you switch off your pocket 
calculator, switch it back on again and repeat the experiment, you will almost certainly 
find exactly the same sequence because they are generated numerically from a seed. For 
this reason (amongst others), mathematicians refer to such series as pseudo-random. Such 
numbers are good enough for experiments where random numbers are needed, and the 
sequence can be reproduced time and time again. If you choose a different seed, you will 
get a different sequence. 


Table 7. / Ten pseudo-random numbers from a calculator 


Keypress 123456789 10 

Number 0.861 0.478 0.127 0.427 0.648 0.275 0.571 0.813 0.471 0.063 


The first thing to do is to calculate the mean, which is given by the sum of the set of 
numbers divided by the number of values. If I write x 2 ,..., x n for the numbers then we 
usually write the mean in brackets <x>, given by 

<*) = ^2> (7-2) 

The next thing we might want to know is the spread of points about the average; some¬ 
times the points will be widely spread about the average, sometimes they will be closely 
clustered. For this experiment, the points are chosen at random with values between 0 and 
1, and so the spread is wide. To get a measure of the spread, we calculate the standard 
deviation _ 

CT„ = - ^2 (Xi - {x)f (7.3) 

N ” « 

For reasons based on statistical theory, you will often find the standard deviation calculated 
with a factor of n — 1 instead of n 


N 


(7.4) 
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For large values of n, the two standard deviations are pretty much the same. Using my data 
I found the following: 

(x) = ^ (0.861 + 0.478 + 0.127 + 0.427 + 0.648 + 0.275 + 0.571 + 0.813 + 0.471 + 0.063) 
{.x) = 0.473 o n = 0.252 o n -1 = 0.266 

There are other ways of reporting a spread, for example the median is the value for 
which there are as many values greater than the median as there are values less than it. So 
for example, a company with three employees and a CEO whose hourly rates of pay are 
£4.20, £4.50, £5.00 and £1000 could truthfully quote in their annual report a mean salary 
of £253.43 per hour. The median is £4.75, which tells a different story. 


7.1 Games of Chance 

Let me start the discussion with a very simple experiment. I take a six-faced die whose 
faces have the usual scores S — 1, 2,..., 6 and throw the die 10 times. My results are 
given in Table 7.2. Note that not all the possible scores have appeared. It is easier to work 
with th e frequency of a given score rather than keep track of every single throw; these are 
shown in Figure 7.1. 


Table 7.2 Throwing a single die 10 times 


Throw 1 23456789 10 

Score, S 6 1 233441 34 


Tossing single die 10 times 






• 

























n_ 




CL_ 
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I 
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1 2 3 4 5 6 


Score 


Figure 7.1 Tossing a single die 


Throwing dice can be very exciting and profitable in the right surroundings, but we would 
like a more automated way to generate results. I gave you the hint with RAN#; and I can 
convert the random numbers between 0 and 1 into scores between 1 and 6 by: 
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• multiplying the random number by 6; 

• adding 1; and then 

• taking the integral part of the number 

In MS/EXCEL, you would write = INT(6*RAND() + 1) as the cell formula and copy the 
formula into as many cells as necessary. 

Table 7.3 shows how things change as we increase the number of throws. 


Table 7.3 Frequencies for throwing a single die n times 


Score, S 

n m !0 

n= 100 

n= 1000 

n m 1 000 000 

1 

2 

19 

158 

1 66 689 

2 

1 

17 

178 

166 346 

3 

3 

21 

179 

167326 

4 

3 

16 

170 

1 65 934 

5 

0 

16 

161 

1 66 923 

6 

1 

11 

154 

1 66 782 

<S> 

3.1000 

3.2600 

3.4600 

3.5004 


1.4457 

1.6286 

1.6728 

1.7080 


Again, I can illustrate this as a histogram (shown in Figure 7.2, where I have plotted the 
fractional frequencies). The front set of results refer to n = 100, and so on. You can see 
that the larger the number of throws, the closer the number of occurrences of each score. 

Throwing single die n times 



Score 


Figure 7.2 Throwing a single die n times 


Advanced texts differentiate between the sample mean (samples of 10, 100, 1000 and 
1 000 000 in this case) and the mean of an infinite number of readings called the population 
mean', it should be intuitively obvious that the population mean is 3y 2 and so sample means 
give approximations to population means. For a large number of experiments, the fractional 
frequencies are found to level off at one-sixth the total number of throws, and the probability 
of each score is defined as the limiting value of the fractional frequency for a large number 
of experiments. In this case the probabilities are all equal to 1/6. 
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7.1.1 Two Dice 

Table 7.4 and Figure 7.3 show what happens when we repeat the experiment with two 
dice. In this case the possible scores are 2, 3,..., 12 and this is certainly not the same as 
having a single 12-sided die. To model the experiment, use your random number gener¬ 
ator for each die and add the result; obviously your values will not be exactly the same 
as mine. 


Table 7.4 Throwing two dice n times 


n= 100 n = 1000 n = 1 000 000 


<S> 7.010 6.980 7.003 

cr 2.500 2.357 2.415 


Throwing two dice n times 



1 23456789 10 11 

Score-1 


Figure 7.3 Throwing two dice 


I will tell you that if we toss p dice then the theoretical mean score is p times the mean for 
a single die, and the standard deviation is Jp times. The distribution has obviously settled 
down by the time we get to a million throws, but what has it settled down to? 


7.2 Enumeration 

It should be clear that we can deduce the theoretical probabilities by asking how many 
ways a given event can occur from all allowed possibilities. The probability is given by the 
ratio of the number of times an event can happen to the total number of ways. Table 7.5 
shows the number of ways the two dice can fall in order to make up a given score. The 
theoretical probabilities are 1/36, 2/36,..., 1/36 whilst the mean and standard deviation 
work out as 7 and 2^/2. These probabilities agree completely with the ‘experimental’ values 
shown. 
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Table 7.5 Enumeration for two dice A and B 


Total score, S 

Possible individual 
scores of A and B 

W = number 
of ways 

Probability 

2 

1+1 


1 

1/36 

3 

1 +2, 

2 + 1 

2 

2/36 

4 

1 +3, 

3 + 1,2+2 

3 

3/36 

5 

1 +4, 
3+2 

4 + 1,2+3, 

4 

4/36 

6 

1 +5, 
4 + 2, 

5 + 1,2+4, 

3+3 

5 

5/36 

7 

1 +6, 
5+2, 

6 + 1,2+5, 

3+4, 4 + 3 

6 

6/36 

8 

2 + 6, 
5+3, 

6 + 2, 3 + 5, 

4 + 4 

5 

5/36 

9 

3+6, 
5 + 4 

6+ 3, 4+ 5, 

4 

4/36 

10 

4 + 6, 

6 + 4, 5 + 5 

3 

3/36 

11 

5 + 6, 

6 + 5 

2 

2/36 

12 

Total 

6 + 6 


1 

36 

1/36 

1 


7.3 Boltzmann Probability 

I have mentioned the Boltzmann distribution several times already, so let us now consider 
a very simple macroscopic system comprising a gas of N inert particles at low pressure in 
a thermally insulated constant container of volume V. 

I am going to look ahead and assume that you know about energy quantization and there¬ 
fore assume that the particles have the very simple energy level diagram shown in Figure 7.4; 
each energy level corresponds to a single quantum state, and the allowed energies are 

Sn =nD n= 1,2,3... (7.5) 

I should mention that not all molecular systems show quantized energy; translational motion 
is not quantized either in quantum mechanics or in classical mechanics. It just makes my 
discussion simpler if we take the energy to be quantized. 

Once again, I am assuming that you have come across such diagrams if you have studied 
elementary atomic structure theory, and you might have been expecting me to add electrons 
as up and down arrows (to denote the spin states). The meaning of these diagrams here is 
a little different; the diagram shows the allowed quantum states for each particle and I add 
particle labels to indicate their quantum states as in Figure 7.5. The left-hand arrangement 
of the three particles A, B and C has particles A and B in the lowest quantum state, none in 
the second and C in the third. I have chosen a different arrangement on the right-hand side 
of the figure. 

I labelled the particles A, B and C to distinguish one from another, but there is more to 
this business of distinguishability than meets the eye. Keep this comment in mind as the 
topic will reappear later in the text. 





Sharing Out the Energy 


101 



In a macroscopic sample of matter, each particle can have many microscopic quantum 
states to choose from and a key question is, for an equilibrium temperature, how do the 
particles distribute themselves amongst their allowed quantum states? Let me illustrate the 
problem with just three particles. 

I am going to make another big assumption in order to keep the treatment simple. 
I assume that the particles form a perfect gas and whilst they may collide with each 
other and with the walls of the container, their energies do not change and indeed I can 
find the total energy of the system just by summing over the energies of the individual 
particles. 

For the sake of argument, I will take the internal energy U as 9 D and the number of 
particles N = 3 as in Figure 7.5. We have to enumerate the ways in which three particles 
with these simple quantum state energies can achieve a total energy of 9 D. 

The rows of Table 7.6 indicate every possible combination of integers that make up 9. 
To save space, I have only included one of each possible arrangements after the first; so 
for example, particle A could have 7 D with particles B and C having 1 D, particle B could 
have 7 D with particles A and C having 1 D, or particle C could have 7 D with particles A 
and B having 1 D. This is the origin of the integer 3 in the final column; it is the number of 
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Table 7.6 Arrangements and probabilities 


A 

B 

C 

W = no. of ways 

Probability 

7 D 

1 D 

1 D 



1 

7 

1 

3 

3/28 

1 

1 

7 



6 

2 

1 

6 

6/28 

5 

2 

2 

3 

3/28 

5 

3 

1 

6 

6/28 

4 

4 

1 

3 

3/28 

4 

3 

2 

6 

6/28 

3 

3 

3 

1 

1/28 


ways we can achieve the total energy with each individual distinguishable particle having a 
certain amount of energy. For clarity, I have omitted the ‘D’ from all entries except the first. 

The numbers in the fourth column are obviously related to factorials; if each particle has 
a different energy then there are 3! ways in which we can allocate the energies amongst the 
three particles. If two energies are the same then we have to divide the 3! by 2! and so on. 
If we were to make a spot check of the quantum state occupancy, we would find on average 
a situation where one particle had 7 D and the other two 1 D, for 3/28 of our measurements 
(on the other hand, since this is a quantum system, I had better be a bit more precise and 
say that if we were to prepare a very large number of such systems and measure the energy 
distribution in each copy, we would find 7 D, ID, 1 D in 3/28 of the systems). 

Each arrangement is known as a configuration. 

Consider now a macroscopic system comprising N identical but distinguishable particles, 
each of which has a similar quantum state diagram to that shown in Figure 7.5. The particles 
are noninteracting. We can fix the total energy U and the number of particles, but we cannot 
fix how the energy is distributed amongst the available energy levels (and it does not make 
much sense to ask about the energy distribution amongst N individual particles, given that 
N might be 10 23 ). The most sensible thing is to say that there are /V, particles with energy 
e,, N 2 with energy e 2 and so on. Suppose also that the highest possible quantum state p has 
energy e p . 

Consideration of our simple example above shows that the number of ways in which we 
can allocate the particles to the quantum states is 


(7.6) 


The fact that a particular quantum state may have no particles (or, in other words, it has a 
zero probability) is accounted for by the fact that 0! = 1. 

It turns out that for large N there is one particular configuration that dominates, in which 
case the values of N t are those that maximize W. At first sight they can be found by setting 
the gradient of W to zero, but N, cannot vary freely; they are constrained by the fact that 
the number of particles and the total energy U are both constant 


N = Ni + N 2 + ■ ■ ■ + N p 
U = N i e 1 + N 2 s 2 H-b N p s p 


(7.7) 
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A standard method for tackling constrained variational problems of this kind is 
Lagrange’s method of undetermined multipliers, which is discussed in detail in all the 
advanced texts. I will just outline the solution without justification. 

The first step in our derivation is to take logarithms of either side of expression (7.6): 

In W = lnM - In TV,-! (7.8) 

For large values of n it is appropriate to make use of the Stirling formula for factorials 

= + + + (7.9) 

In practice all terms except n In n are usually omitted and we have 

In IT = In N\ — (N, In N t l - N t ) (7.10) 


We therefore look for a maximum of In W. We know from elementary differential 
calculus that 

dlnW = V^-^dA, = 0 (7.11) 

dN ‘ 

at a stationary point. We cater for the constraints by differentiating Equations (7.7) 

0 = dNi + dN 2 + ■ ■ ■ + dN p 
0 = d NiEi + dN 2 e 2 H-h dA^e^ 

multiplying by arbitrary undetermined multipliers that I will call a and —f and then adding: 

dIn W ~^~ dNi + a m - P e ‘ dN ‘ (7.1 2 ) 


All the dNj can now be treated as independent variables and at a stationary point 


3 In W 

dNi 


p Si = 0 


(7.13) 


All that remains is to use the Stirling formula, and the final result turns out as 

N { = exp (a) exp (—y6e,) (7.14) 

Next we have to evaluate the undetermined multipliers a and P ; if we sum over all the 
allowed quantum states then we have 


N = Y, N, = exp (a) Y ex P (~P s i) 


which gives 

exp (a) = — - 

L ex P (-Pei) 


(7.15) 
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The value of fi is not quite as straightforward. What we need to do is to use the expression 
to calculate a quantity such as the heat capacity of a perfect monatomic gas which can be 
compared with experiment. When this is done we find 



(7.16) 


The formula is correct even when the quantum states are degenerate, as it refers to the 
individual quantum states. It also holds in an equivalent integral form when the energies 
form a continuum. 


7.4 Safety in Numbers 

I should now make a few points about Boltzmann’s law. First of all, the basic result 



can only be an approximation, for if I take a system of 10 000 two-state particles in 
equilibrium at a temperature T = (e 2 — £i )/k B then we find 


Nr 

— = 0.36788 

Ni 


which corresponds to N 2 = 7310.6 and Ni = 2689.4. These estimates cannot be true, since 
the numbers must be integers. The particles inevitably collide with one another and interact 
with the surroundings (technically referred to as the heat bath), and the picture that emerges 
is one of ceaseless activity. The number of particles in each state fluctuates from one moment 
to the next as energy flows into and out of the system via the heat bath. We therefore have 
to understand that the numbers appearing are averages, which I ought to write as 



(7.17) 


In order for Boltzmann’s law to apply, the system must be in thermal equilibrium. The 
particles must be distinct and independent, which means that it must be possible in principle 
to distinguish one particle from another (by colouring, numbering or whatever means we 
want to use). With all of that in mind, I will drop the ‘<.. .>’ notation unless it matters. 

The average number of particles with energy e fluctuates at random; the average popu¬ 
lation is given correctly by Boltzmann’s law but the actual population fluctuates. Do these 
fluctuations matter when we want to predict macroscopic properties? I want to show you 
in this section that they are negligible. 

First of all, let me establish that there is truly safety in numbers. Consider tossing a 
number of coins a certain number of times. It is common in the UK to call one side of the 
coin the ‘head’ side (FI) and the other side the ‘tail’ side (T). In this experiment, I am going 
to throw the coin(s) in the air and record whether the coin lands H or T uppermost. I then 
repeat the experiment (say) 10 times. There is no particular reason for the coin to land either 
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H or T uppermost, and so the theoretical probability of either event is ’/ 2 . Table 7.7 shows 
a typical result. 


Table 7.7 Heads and tails 


Experiment 1 234 5 67 8910 

Outcome TTTHHTHTTT 


If I plot out the results (Figure 7.6) we see that the result is of course unpredictable. We 
cannot say what each result will be. 


Ten throws of coin 


g 0.5- 


1 2 3 


Figure 7.6 Single coin tossed 10 times 


I now increase the number of coins to 10 and repeat the experiment again 10 times. That 
is to say, I take 10 coins and throw them into the air simultaneously. I record the result and 
then repeat the experiment a further 9 times. The results of my experiments are shown in 
Figure 7.7 (which of course I simulated with a random number generator). 


Ten throws of 10 coins 


■llvllllvl 


a 


7 8 9 10 


Throw 

Figure 7.7 Ten coins tossed 10 times 


We would expect an average of 5 heads per throw but we see a significant amount of 
scatter. I now repeat the ten experiments with 100 coins. The results are shown in Figure 7.8, 
and you can see that there is much less scatter about the mean. The point is that there is 
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60 

| 40 
X 20 


Figure 7.8 One hundred coins tossed 10 times 


Ten throws of 100 coins 



1 23456789 10 


safety in numbers; if we use 100 coins we can be pretty sure of getting a result between 40 
and 60 heads. 

To return to the energy problem, because of the very large number of particles that go 
to make up a macroscopic sample, we can be very certain that fluctuations about the mean 
are negligible. 


7.5 Partition Function 


As discussed above, I can write Boltzmann’s law as 

5 exp (-j&) 

The denominator in the Boltzmann expression turns out to be a useful quantity and so it is 
given a special name and symbol: 




Q is the partition function. Some authors use Z rather than Q, and lowercase symbols z 
and q are often encountered when the emphasis is on individual particles rather than the 
macroscopic collection of particles. 

These formulae are correct even when the energy levels are degenerate (i.e. each energy 
level corresponds to a number of quantum states), but of course the sums run over the 
quantum states and not the energy levels. In the above formulae, N } is the number of 
particles in one of the quantum states and many authors choose to write the sums over the 
energy levels. For this purpose, they introduce a degeneracy factor g, for each energy level, 
and so 

AT, *' eXP (-i^) 

S s " ,xp (-^) 


(7.20) 
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It is understood that the sum runs over distinct energy levels and not the individual quantum 
states. I have used the symbol n in the summation to indicate the number of available 
energy levels. 


7.6 Two-Level Quantum System 


You should be familiar with the phenomenon of spin and know that charged particles 
with spin behave essentially as small bar magnets. According to the laws of (classical) 
electromagnetism, any particle with charge Q, mass M and angular momentum 1 has a 
magnetic dipole moment, a vector quantity that is usually written p m : 


Pm 


(7.21) 


The energy of a magnetic dipole in an external field B (Figure 7.9) is given by 


and according to the rules of classical electromagnetism, all orientations of the dipole in 
the field are allowed but each orientation corresponds to a different energy. 



The quantum mechanical picture is quite different; only certain orientations are allowed, 
and the energy is quantized. The Stern-Gerlach experiment demonstrated that electrons 
have an ‘internal’ magnetic moment called electron spin , and only two orientations are pos¬ 
sible. These are visualized as being (roughly) parallel to the field and (roughly) antiparallel 
to the field and the energies are given by 


where m s is the spin quantum number, which takes values of ±y 2 for electron spin. The lower 
energy state e, is therefore that with m s — — y 2 , the higher energy state e u that corresponding 
to m s = +y 2 . 
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Consider then a sample of A such two-state particles. (It is hard to do experiments on 
free electrons, so we had better imagine a sample of hydrogen atoms; the only fly in the 
ointment is that the nuclei also have a spin of 1 / 2 which I am going to ignore.) We keep the 
temperature constant by surrounding the sample with a heat bath and keep the magnetic 
field fixed. Figure 7.10 shows a few of the hydrogen atoms in the heat bath. 


Heat Bath 



The diagram should be pretty obvious: atom A has its electron in the lower spin state, 
atom B has its electron in the higher spin state and so on. The heat bath keeps the temperature 
of our system constant. 

Do not get confused with electronic energy levels, though. If A u and A are the number 
of electrons in the upper and lower spin energy levels and A the total then 


A = A u + A 

and the magnetic internal energy of the system is 

U = N a e n +N l s l 

We also know from the Boltzmann distribution law 


(7.23) 

(7.24) 


A exp 


(a!) 


In order to simplify the algebra, I will take the magnetic energy zero to be £| and the energy 
difference s u — £ } to be D. With this notation, explicit expressions for A u and A are 


D N 


A = 


A 


- _ ) N u = - 

1+exp (“^) 1+exp (“^) 

Figure 7.11 shows how the populations vary with temperature. 
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7.7 Lindemann’s Theory of Melting 

We are going to catch our breath here, and draw together several strands already discussed. 
Figure 7.12 shows a two-dimensional cut through a simple solid. Think of a crystal where 
all the particles are the same, such as argon or possibly a metal. Metals actually get us into 
deep water (so to speak) because they have a sea of free electrons, which is why many 
conduct electricity, but do not worry about this problem for the minute. The important 
thing is that the particles do not appear to interact with each other and they are all the 
same. (Actually, if the particles did not interact they would not form a crystal, they would 
always exist as a perfect gas. That is a matter for discussion later in the text.) The bottom 
right-hand particle is constrained to sit in its equilibrium position by the other particles, 
and we will allow it to make simple harmonic excursions from its equilibrium position 
depending on the temperature. The higher the temperature, the higher are the excursions 
from the equilibrium position. So when does this solid melt? 

The treatment I am going to give is a very simple one that relies on the equipartition 
of energy principle. You will probably know that a molecule comprising N atoms has 3 N 
degrees of freedom, and that we usually think if these as 3 translational, 3 rotational (or 2 
if the molecule is linear) and 3N — 6 vibrational (3 N — 5 if the molecule is linear) degrees 
of freedom. According to the laws of classical physics, each of these degrees of freedom 
has energy of '/ik R T. 






no 
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Figure 7.12 Part of a three-dimensional crystal 


Now back to Figure 7.12. For the minute, assume that the particles only vibrate simple 
harmonically along the horizontal axis, which I will call the x axis. The energy of each 
particle is 

£tot = £ kinetic + ^potential = + 2^ s ^ _ Xe ) 2 (7.27) 

According to classical physics, the average translational energy of any atom or molecule 
is always T per degree of freedom. But, because the particle is vibrating simply har¬ 
monically, the average value of the potential energy is also equal to per degree of 
freedom. In other words 

l -K[{x-x e f)= l -k B T (7.28) 

Taking the square root of both sides gives 

7((*-*e) 2 }= by- (7.29) 

The expression on the left-hand side is the root mean square of x and so 


_ k B T 

Xims ~ K 


(7.30) 


Most authors agree that a solid will melt when x rms achieves a substantial percentage of 
the natural spacing d between the particles in a crystal. Many take 10%, and so 


T meIt = 


k s d 2 

100k B 


(7.31) 


Table 7.8 gives data for five metals, and the agreement between Lindemann’s model and 
experiment is very reasonable for such a simple model. 
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Table 7.8 The Lindemann model for melting 


Solid 

d/pm 

E 

z 

3 1 

T me i,(theory)/K 

T melt (expt)/K 

Copper 

230 

30 

1163 

1356 

Gold 

260 

20 

991 

1336 

Lead 

310 

5 

352 

600 

Silver 

260 

21 

1041 

1234 

Tungsten 

250 

100 

4582 

3650 


7.8 Problem Set 

A set of problems, together with solutions, is available on the associated website. 
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Introduction to Statistical 
Thermodynamics 


I assume that you are familiar with the material presented in Chapter 7. 

Suppose then that we have a macroscopic pure liquid sample, which might consist of 10 23 
particles, and we want to try and model some simple thermodynamic properties like the 
pressure p, the internal energy U or the Gibbs energy G. At room temperature, the individual 
particles making up the sample will be in motion so at first sight we ought to try and solve 
the equations of motion for these particles. In view of the large number of particles present, 
such an approach would be folly. Just to try and specify the initial positions and momenta 
of so many particles would not be possible, and in any case such a calculation would give 
too much information. You will know from Chapter 7 that the correct approach has to be a 
statistical one. 

For the sake of argument, suppose that the container is a cube. I show a two-dimensional 
slice through the cube on the left-hand side of Figure 8.1, and I have exaggerated the size 
of the particles by a factor of approximately 10 10 . 

The pressure exerted by a gas on a container wall depends on the rate at which particles 
collide with the wall. It is not necessary, or even helpful, to know which particle underwent 
a particular collision. What we need to know about are the root mean square speed of the 




Figure 8.7 Box of particles 
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particles, their standard deviation about the mean, the temperature and so on. In statistical 
thermodynamics, we do not enquire about the behaviour of the individual particles that 
make up a macroscopic sample; we just enquire about their average properties. 

If we were to measure the pressure exerted on the walls at time intervals t\,t 2 ,...,t„ then 
we might record results p{ti),p{t 2 ),..., p(t„). We could calculate a sample mean <p> and 
a sample standard deviation using these results: 

ip) = \ 


' X>tt)-(P» 2 (8.1) 

n " 

We might expect that the greater the number of measurements, the closer the sample mean 
would be to the true mean, and the smaller the sample deviation would become. 



8.1 The Ensemble 

When we considered Figure 8.1, I was careful to draw your attention to the differ¬ 
ence between particle properties and bulk properties. I also mentioned that classical 
thermodynamics is essentially particle-free; all that really matters to such a thermody- 
namicist are bulk properties such as the number of particles N, the temperature T and the 
volume of the container V. I have represented this information in the right-hand box in 
Figure 8.1. 

Rather than worry about the time development of the particles in the left-hand box in 
Figure 8.1, what we do is to make a very large number of copies of the system on the 
right-hand side. We then calculate average values over this large number of replications 
and, according to the ergodic theorem , the average value we calculate is exactly the same as 
the time average we would calculate by studying the time evolution of the original system. 
The two are the same. 

I am not suggesting that all the cells in the ensemble are exact replicas at the molecu¬ 
lar level: all we do is to ensure that each cell has a certain number of thermodynamic 
properties that are the same. There is no mention of molecular properties at this stage of 
the game. 

Figure 8.2 shows an ensemble of cells all with the same values of N, V and T. 

This array of cells is said to form a canonical ensemble. There are three other important 
ensembles in the theory of statistical thermodynamics, and they are classified according to 
what is kept constant in each cell. Apart from the canonical ensemble, where N, V and T 
are kept constant, we have the following. 

• The microcanonical ensemble where N, the total energy E and V are kept constant in 
each cell. In fact, this is a very simple ensemble because energy cannot flow from one 
cell to another. 

• In an isothermal-isobaric ensemble, N, T and the pressure p are kept constant. 
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• Finally we have the grand canonical ensemble, where V, T and the chemical potential 
are kept constant. The grand canonical ensemble is a fascinating one because the number 
of particles is allowed to fluctuate. 


N, V ; T 

N, V, T 

N, V , T 

N, l/, T 

N, V , T 

N, V, T 

N , V, T 

N ; V, T 

N, V , T 

N, V, T 

N, V, T 

N, V, T 


Figure 8.2 Canonical ensemble 

Supposing then that we consider a canonical ensemble of N* cells (not o be confused 
with the N particles) comprising the original cell together with N* — 1 replications. Energy 
may flow between the cells, but the total energy of the ensemble is constant. Suppose 
that the possible total energies of the N particles contained in each cell are E*, E*, and so 
on. We take an energy snapshot, and find a distribution of energies amongst the cells as 
follows: 

N* cells have energy E* 

/V 2 * cells have energy E*, etc. 

According to Boltzmann, E* and N* are related by 



Be careful to understand that the energies E* are not molecular energies; they are the total 
energies of the collection of the N particles contained in each cell. Also note that N* is the 
number of cells in the ensemble, and that the energies are taken as relative to a common 
arbitrary zero. 

The denominator in the expression above plays an important role in our theory, and so it 
is given a special name and symbol: 

e = E ex p(-j^) ( 8 - 3 ) 

Q is (in this case) the canonical partition function, and it can be used to calculate the usual 
chemical thermodynamic functions as follows. 
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8.2 Internal Energy, U th 

The IUPAC recommended symbol for thermodynamic internal energy is U, but I have added 
a subscript ‘th’ for ‘thermodynamic’ so that there is no confusion with the total potential 
energy of a system U (often written i> ). Internal energy is obviously related to the ensemble 
energy average but we have to exercise caution. Chemical measurements only give changes 
in the internal energy, not absolute values. I will therefore write the internal energy as 
U& — U Q , where U 0 is an arbitrary constant. For most purposes we can take U 0 to be zero. 
We have, for the ensemble of N* members 


EN*E* 



(8.4) 


and according to the ergodic theorem, this is equal to the time average of U& — U 0 for any 
one cell. Using the Boltzmann expression we have 



Q 


(8.5) 


I can tidy up Equation (8.5) by noting 



(8.6) 


and so on substitution 



(8.7) 


8.3 Helmholtz Energy, A 


From the definition of A we have the following: 

A = U — TS 




( 8 . 8 ) 


A little manipulation gives 



A — A 0 = -k B T\nQ 


(8.9) 


Again, the arbitrary constant A 0 can be taken as zero, since only changes in A are ever 
measured. 



8.4 Entropy S 

Finally, since 
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we have 



( 8 . 10 ) 


8.5 Equation of State and Pressure 


The pressure is related to the Helmholtz energy by 


P = ~ 



and so we find 




( 8 . 11 ) 


This equation is sometimes called the equation of state. The enthalpy and the Gibbs energy 
can be derived using similar arguments. They turn out to be 




( 8 . 12 ) 


8.6 Phase Space 

Sophisticated methods such as those due to Hamilton and to Lagrange exist for the system¬ 
atic treatment of problems in particle dynamics. Such techniques make use of generalized 
coordinates (written q u q 2 ,..., q„) and generalized momenta (written p u p 2 ,..., p„); in 
Hamilton’s method we write the total energy as the Hamiltonian H. H is the sum of the 
kinetic energy and the potential energy, and it is a constant provided that the potentials are 
time independent. H has to be written in terms of the p’s and the q’s in a certain way, and 
systematic application of Hamilton’s equations gives a set of differential equations for the 
system. 

To fix our ideas, consider the particle of mass m undergoing simple harmonic motion as 
discussed in Chapter 4. In this one-dimensional problem I wrote the potential as 


U = -K (R - KY 
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so that the total energy is 


1 /dR\ 2 1 

S =2 m U) + 2 MR - K - ) ' 


If I put q = R — R e then the momentum p is mdR/dt and I can write the Hamiltonian as 


We say that the particle moves through phase space and in this example the trajectory 
through phase space is an ellipse (Figure 8.3), which can be easily seen by rewriting 
Equation (8.13) as 



P 


Figure 8.3 Phase space 


Hamilton’s equations of motion are 


d q, _ 8H dp, 
dt dpi ’ df 


dH 

Hi 


(8.14) 


so for a general problem with N atoms, we have to solve 6N first-order differential equations 
rather than the 3N second-order differential equations we would get from straightforward 
application of Newton’s law. 

In the case of a one-particle three-dimensional system, the Hamiltonian will be a function 
of the three coordinates q and the three momenta p, and for a more general problem 
involving N particles the Hamiltonian will be a function of the 3 N q’s and the 3 N p’s. We 
say that the p’s and the q’s together determine a point in 6/V-dimensional phase space, and 
this point is often denoted r. 
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8.7 Configurational Integral 


Returning now to the canonical partition function, Equation (8.3) 




the first thing to note is that all points in phase space contribute to the sum, and the summation 
has to be replaced by an integral. For an ideal monatomic gas the expression becomes 



(8.15) 


The equation is often written with the Hamiltonian H replacing E, for the reasons discussed 
above. 

The TV! term is needed in situations where the particles are completely indistinguishable 
from one another; for particles that can be distinguished there is no TV! term. The integrals 
have to be done over the spatial variables of all the TV particles, and also the momentum 
variables of the TV particles. The integral is therefore a 6TV-dimensional one. 

The energy (the Hamiltonian) is always expressible as a sum of kinetic and potential 
energies, and I have written the mass of each particle in: 



(8.16) 


Kinetic energies depend on the momentum coordinates p. All the potential energies we 
will meet depend on the spatial coordinates q but not on the momenta and so the partition 
function can be factorized into a product of a kinetic part and a potential part: 



The kinetic integral has to be done over the momentum coordinates of all TV particles, and 
it can be seen to be a product of TV identical three-dimensional integrals of the type 



Each of these is a product of three identical standard integrals of the type 



and the final result is 



(8.18) 


The 3TV-dimensional integral over the position variables is often referred to as the con¬ 
figurational integral. For an ideal gas 0=0 and so the configurational integral is V N , 
where V is the volume of the container. Some authors include TV! in the definition of the 
configurational integral. 
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The canonical partition function for an ideal gas is therefore 


V N (2jrmk H T\ 3N 
2= T7T -15- 


(8.19) 


The partition function for a real system is often written as the product of an ideal part and 
an excess part due to nonideal behaviour: 

Q — <2 ldeal <2 excess where 


2 excess = ^ / ex P (-JTt) dq (8 ’ 20) 

The point of doing this is that thermodynamic properties such as A are often measured 
experimentally as an ideal and an excess part: 

A = A ideal +A excess 


The ideal part can be related to <2' dcal and the excess part to <2 excess - 


8.8 Virial of Clausius 


Let me focus attention on one particular particle i moving in the box shown in Figure 8.1. 
As this particle moves it will be subject to some varying force F, and 
dv, 

F ‘ = m ~di (8 ' 21) 

Taking the scalar product of both side of this equation with r, we get 
r,.F, - 


-( 2 ) 


Consider now the vector identity 


which can also be written 


On comparison of Equations (8.22) and (8.24), we have 


( 8 . 22 ) 

(8.23) 

(8.24) 

(8.25) 


The next step is to sum corresponding terms on both sides of the equation for each particle 
in the box. For N particles each of mass m, this gives 

rF = --m— Y"r v + YV 

2 2 2 1 


(8.27) 
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Finally, we take a time average over all the particles in the box, which is assumed to be in 
an equilibrium state: 


2 




(8.28) 


The second term on the right-hand side is obviously the mean kinetic energy of all the 
particles in the box. This must be \Nk B T, according to the equipartition of energy principle. 

Whatever the value of the first average quantity in brackets on the right-hand side it 
cannot vary with time because we are dealing with an equilibrium state and so the first time 
derivative must vanish: 



Thus we have 



(8.29) 


The summation term on the left hand side 

involving the forces and coordinates is often referred to as the virial of Clausius. According 
to the equipartition of energy principle, it is equal to \Nk^T. 
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Monte Carlo Simulations 


When discussing QSAR in Chapter 6,1 mentioned that the molecular volume was thought 
to be a useful index and explained how it could be calculated within the framework of a 
MM study. 

To recap, Figure 9.1 shows a diatomic molecule, where the atomic radii are R A and R n . 
The problem is to find the volume of the molecule AB. The volume of atom A is ^ttR a and 
the volume of atom B is \tcR\ . The volume of AB is of course less than or equal to the sum 
of these atomic volumes depending on the extent to which the atoms overlap. We have to 
be careful not to count the shared volume X twice in the calculation. 



To expand on the discussion in Chapter 6,1 suggest that the total can be calculated using 
the following algorithm. 
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1. Start with A, for which the volume is |tt R 3 a . 

2. Surround B with a regular grid of side 2f? B , and examine which grid points lie inside B 
but exclude those that he in region X. To do this, test if they are within distance R A of 
nucleus A. 

3. Also exclude points which lie outside atom B. 

4. A simple ratio of points then gives the contribution to the molecular volume from atom B. 

The process can be easily extended to any polyatomic system. This is a pretty simple 
problem; suppose there are N atoms and we take a grid of m one-dimensional points for 
each atom. This means that we will have to consider (N — 1) x m 3 points in space. If 
N = 101 and m = 50 the number is 12.5 x 10 6 ; a large number of thankfully very simple 
calculations. 

Rather than take a regularly spaced cubic array of points around atom B, we can surround 
atom B by an (imaginary) cube of side 2 R B and choose points at random inside this cube. 
Here is the revised algorithm. 

1. Choose a point at random and examine whether it lies inside atom A, in which case we 
accept it. 

2. If the point lies inside atom B and not in region X, we accept it. 

3. Increment the number of points and return to 1. 

A simple proportionality between the number of successful points and the total number 
tried gives the required volume. 

This calculation is an example of the Monte Carlo technique (denoted MC in this text). 
MC is a generic term applied to calculations that involve the use of random numbers for 
sampling; it became widely used towards the end of the Second World War by physicists 
trying to study the diffusion of neutrons in fissionable material. 

In fact, the MC method is usually attributed to a French eighteenth-century naturalist 
named Buff on, who discovered that if a needle of length l were dropped at random onto a 
floor consisting of wooden planks with uniform separation d > l, then the probability of 
the needle crossing the join between two planks is 21/nd. Buffon’s subsequent experiments 
enabled him to make an accurate estimation of it. 

In the case of the molecular volume calculation, the problem scales as the number 
of atoms. That is good news. We can easily identify worse problems; for example, the 
trapezium rule enables us to approximate a definite integral 


j fix) dx = ^d(f (a) + 2f (a + d)) + ■ ■ ■ +f (b)) 


as the sum of trapezia shown in Figure 9.2. If we choose n points along the x axis then we 
have to evaluate the function (n + 1) times in order to evaluate the sum. 

For the corresponding two-dimensional integral 



we would have to make (n+ l) 2 function evaluations and the number of function evaluations 
is seen to rise as the power of the dimension of the integral. That is bad news. 
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f(x) 



b 


x 


Figure 9.2 Trapezium rule for a definite integral 


9.1 An Early Paper 

Metropolis and Ulam’s (1949) keynote paper has an interesting abstract. 

We shall present here the motivation and a general description of a method dealing with a class 
of problems in mathematical physics. The method is, essentially, a statistical approach to the 
study of differential equations, or more generally, of integro-differential equations that occur 
in various branches of the natural sciences. 

and there is much to be gained by studying the opening sentences... 

Already in the nineteenth century a sharp distinction began to appear between two dif¬ 
ferent mathematical methods for treating physical phenomena. Problems involving only a 
few particles were studied in classical mechanics, through the study of systems of ordinary 
differential equations. For the description of systems with very many particles, an entirely dif¬ 
ferent technique was used, namely, the method of statistical mechanics. In this latter approach, 
one does not concentrate on the individual particles but studies the properties of sets of 
particles. 

Straightforward thermodynamic quantities such as the pressure, the internal energy and 
the Gibbs energy turn out to be impossible to calculate directly for a macroscopic system, 
simply because of the large number of particles involved. In fact, it is not even sensible to 
contemplate recording an initial starting point for 10 23 particles, let alone devising methods 
for solving the equations of motion. 

Suppose for the sake of argument that we have a system of N simple atomic particles (such 
as argon atoms) and we are interested in the total mutual potential energy (conventionally 
written <t> rather than U). If the pair potential is Ujj(R) and the potentials are pair wise 
additive, we have 



If I denote the position vectors of the N particles measured at time t by R A (f), Rr( 0, ■ • •, 
R„(t), then these position vectors will depend on time and the instantaneous value of 0 
will depend on the values of the variables at that time. If we make enough measurements 
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0(t 2 ), ■ ■., 0{t„) at times t u then the average of these measurements will 

approach the mean value of 0 : 



Other properties such as the self-diffusion depend on fluctuations about mean values. 

In statistical thermodynamics, rather than calculating such a time average we consider 
the average over a large number of replications of the system (an ensemble). The ergodic 
theorem tells us that the two are the same. 

In Chapter 8 I considered the special case of a canonical ensemble, which consists of 
a large number of replications (or cells), each of which are identical in the sense that the 
number of particles N, the volume V and the temperature T are the same. The cells are not 
identical at the atomic level, all that matters is N, V and T. Energy can flow from one cell 
to another, but the total energy of the ensemble is constant. 

Under these conditions, the chance of finding a cell with energy E is proportional to the 
Boltzmann factor exp(—E/k B T). Mean values of quantities such as the energy <E> can 
be found by averaging over the replications 



(9.1) 


or for an infinite number of replications we would replace the sums by integrals 




<£> = 



Provided the potential energies do not depend on momenta, it is possible to simplify 
such integrals and I explained earlier the importance of the configurational integral, which 
depends only on exponential terms such as 



(9.2) 


involving the total mutual potential energy 0 .1 also showed how it could be related to the 
so-called excess thermodynamic functions. 

I have written the configurational integral in a simplified way; if there are N particles, it is 
actually a 3/V-dimensional integral over the positions of the N particles. We might want to 
take N — 10 000 particles for a very simple simulation but the considerations above show 
that it is impractical to carry out such a multidimensional integral by usual techniques of 
numerical analysis such as the trapezoid rule. Instead we resort to the Monte Carlo method 
and generate a representative number of points in conformation space. For each point we 
calculate a value of 0. The integral is approximated by a finite sum. 
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The crucial point is that there is no momentum contribution to the partition function 
in MC. The momentum contribution is essentially what we would expect for a system of 
noninteracting particles. 


9.2 The First ‘Chemical’ Monte Carlo Simulation 

The first serious Monte Carlo calculation appears to be that of Metropolis et al. (1953) 
(although the method was proposed independently by J.E. Meyer and by S. Ulam). In 
fact, the Metropolis et al. paper marked the birth of computer simulation as a statistical 
mechanical technique, and it preceded the molecular dynamics studies to be discussed in 
Chapter 10.1 cannot do better than repeat the words of the authors (i.e. quote their abstract): 

A general method, suitable for fast computing machines, for investigating such properties as 
equations of state for substances consisting of interacting individual molecules is described. 
The method consists of a modified Monte Carlo integration over configuration space. Results 
for the two-dimensional rigid-sphere system have been obtained on the Los Alamos MANIAC, 
and are presented here. These results are compared to the free volume equation of state and to 
a four-term virial coefficient expansion. 

The rigid-sphere potential shown in Figure 9.3 is the simplest one imaginable; the spheres 
cannot overlap, but their mutual potential energy is zero everywhere else. 


£ 


0 


Distance, R 


Figure 9.3 Rigid sphere potential 

The authors took a two-dimensional square array of 224 hard spheres (i.e. hard discs ) 
each with fixed radius o. Here is the algorithm. 

1. A starting configuration is chosen. Metropolis placed the 224 discs in a triangular lattice 
of 14 particles per row by 16 particles per column, alternate rows being displaced by 0.5cr. 

2. Each disc is moved in succession in the XY plane according to the formula 

X^X + a^ 

Y^Y + ab 
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where a is some (arbitrary) maximum allowed distance and are random numbers 
between — 1 and +1. This means that after we move a particle, it is equally likely to be 
anywhere within a square of side 2er centred about its original position. 

3. At the end of the moves, the discs are examined for overlap. In the case that the move 
did indeed put one disc on top of another disc, the particles are returned to their original 
positions. This is a primitive form of the technique known as importance sampling that 
is now a key feature of modern MC simulations. 

4. We then calculate any useful data for averaging. In this model, 0 — 0 for all allowed 
conformations, but we might be interested for example in the disc-disc separations in 
order to investigate the radial distribution functions to be discussed in later chapters. 

5. After a large number of allowed configurations have been examined (and the calculation 
has lost all knowledge of its starting configuration), we stop and do the final averaging. 
The choice of a is actually important here. If a is large then discs invariably overlap 
and if a is small then the conformations do not change much and in either case we have 
to perform a very large number of iterations to get a meaningful average. It is normally 
chosen so that 50% of the trial moves are accepted. 

6. Finally the acid test: compare with experiment. It is rather difficult to think of a real-life 
system that could be realistically represented by the rigid-sphere model but the authors 
did their best; they used the virial theorem of Clausius to derive an equation of state for 
the rigid discs 



where <«> is the average density of other particles at the surface of a particle and A 
the area. They then calculated <n> from the MC simulation and compared with the 
equation of state. 


9.3 Importance Sampling 

In the case of a more complicated problem (even a modest Lennard-Jones potential), a 
more sophisticated treatment of importance sampling is needed for the following reason. 
The average potential energy can indeed be obtained from the configuration integral 



(9.3) 


and each integral can indeed be approximated by a finite sum of M terms in MC 



(9.4) 
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If we calculate the mutual potential energy i>,(q) for a random array of particles, then the 
array will have a Boltzmann weight of 



and if some particles are close together 0, will be large and the Boltzmann factor small. Most 
of phase space corresponds to nonphysical configurations with very high energies and only 
for a very small proportion of phase space does the Boltzmann factor have an appreciable 
value. This means that most of the computing time will be spent on configurations that give 
negligible contributions to properties of interest. 

Metropolis et al. (1953) therefore proposed a modified MC scheme where, instead of 
choosing configurations randomly then weighting them with exp(— 0 /k n T), configurations 
were chosen with a probability distribution of exp(— 0/k B T) and then weighted equally. 
This adds a branch to the algorithm as follows. 

1. Before making a move, we calculate the energy change of the system caused by the 
move. If the energy change is negative, the move is allowed. 

2. If the energy change is positive, we allow the move but with a probability of 
exp(—0/k B Ty, to do this, we generate a random number £3 between 0 and 1, and if 



we allow the move to take place. 

3. Otherwise we leave the particle in its old position and move on to the next particle. 

4. Whether the move is allowed or not, we consider that we have generated a new 
configuration for the purposes of taking averages. 

This summarizes the Metropolis Monte Carlo (MMC) technique. The crucial feature of 
MMC is that it biases the generation of configurations towards those that make significant 
contributions to the configuration integral. It generates configurations with a probability 



and then counts each of them equally. By contrast the simple MC integration method 
generates states with equal probability and then assigns them this weight. 


9.4 Periodic Box 

Figure 9.4 shows a box of argon atoms, and examination reveals two problems. 

1. Atoms near the edges of the box will experience quite different resultant forces from 
those atoms near the centre of the box. 

2. Second, a MC move may well place one of the atoms outside the box and this will alter 
the density of the system under study. 


The periodic box concept, illustrated in Figure 9.5, gives a solution to these problems. 
We appeal to the ensemble concept of statistical thermodynamics, and surround our system 
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Figure 9.5 Small part of the molecular ensemble 
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with a large number of identical copies. In this case the boxes are truly identical at the atomic 
level rather than in the usual thermodynamic sense of having N, V and T in common. 

Figure 9.5 shows a two-dimensional shoe through a small portion of the system (the 
central box where the atoms are shown grey) and the copies (where the atoms are shown 
black). Each copy is identical at the atomic level, and each corresponding atom undergoes 
the same MC change as its image in every other copy. 

If a MC move causes the dark grey atom (top left, central cell) to leave the central cell, 
its image enters from an adjoining copy, shown by the vector displacements in the figure. 
This keeps the density constant. 

There are no effects due to the walls because each atom in the central cell is under the 
influence of every other atom in the central cell and in all other cells. 


9.5 Cut-Offs 

This means we have to be able to calculate the mutual potential energy of (for example) the 
dark grey atom with all the others shown in Figure 9.5. This would give an infinite sum. 

In the case of a short-range potential such as the Lennard-Jones, we simply decide on 
a cut-off distance R c beyond which the pair potential will be negligible; this defines a 
sphere of radius R c centred on each atom. In order to treat the dark grey atom, we just 
include contributions from all other atoms in the sphere as shown in two dimensions in 
Figure 9.6. 



Figure 9.6 The cut-off distance 
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Truncation of the intermolecular potential at a cut-off distance actually introduces two 
technical difficulties. Firstly, the pair potential has a discontinuity at the cut-off distance 
Rc, and, secondly, whenever a pair of particles A and B has separation greater than R c the 
total energy is not conserved. The first problem is solved by shifting the potential function 
by an amount U(Rc)', that is, we take 


{ UQIab)- U(R c ) if R Aft < R c 
{ 0 if Rab > R c 


(9.5) 


The second problem can be solved by adding a small linear term to the potential, chosen 
so that its derivative is zero at the cut-off distance 

V- <*„) = j v (s “>- u <s '> - (£), c <fi “ - *=> ““ 2 * 
l o C ifRAB > R c 


9.6 MC Simulation of Rigid Molecules 

For rigid nonspherical molecules, it is necessary to vary their orientation as well as the 
position in space; usually each molecule is translated and rotated by random amounts once 
per cycle. The simplest approach is to rotate about each Cartesian axis in turn by an angle 
chosen at random but subject to a suitable maximum allowed variation. For example, if the 
position vector of an atom is the column vector R, the position vector after rotation by a 
about the x axis is given by 

/! ° 0 \ 

R = I 0 cos a sin a I R (9.6) 

Y 0 — sin a cos a J 

The three Euler angles 0, 6 and i Jr are often used when discussing rotational motion. 
First of all we rotate by 0 about the Cartesian z axis. This changes the x and y axes to x! 

and y 1 as shown in Figure 9.7. Then we rotate by 9 about the new x axis which changes 



Figure 9.7 First Euler angle 
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the y and z axes. Finally we rotate by i/r about the new z axis. The relationship between a 
position vector before and after the three rotations is 


( cos </> cos i/r — sin </> cos 0 sin i/r 
- cos </> sin xl/ - sin 0 cos 0 cos ^ 


sin </> cos xjr + cos cj> cos 0 sin if/ 
— sin </> sin i/r + cos </> cos 0 cos xjr 
— cos 0 sin 0 


sin 0 sin f \ 
sin 0 cos i/f R 

COS0 / 


(9.7) 


There are two technical problems with the use of Euler angles. First, sampling the angles 
at random does not give a uniform distribution; it is necessary to sample i/r, cos 6 and \js. 
Second, there are a total of six trigonometric function evaluations per rotation. 

A preferred alternative makes use of quaternions, which are four-dimensional unit 
vectors. A quaternion q is usually written in terms of the scalar quantities q 0 , q\, qi, <y 3 as 

q = ( qo qi ?2 q 3 ) T 

and the components satisfy 

ql + q\ + q 2 2 + q\ = 1 


They can be related to the Euler angles as 

q 0 = cos \Q cos | (<p + VO 
q l = sin f 9 cos f (<j> — VO 
q 2 = sin \6 sin f (<j> — \ /r) 
q 3 = cos \0 sin f (<j) + \jr) 

The rotation matrix can be written as 

( ql + ql - ql - ql 2 (^ 2 + <m 3 ) 2(^,^ 3 - q 0 q 2 ) \ 

2 ( 41^2 - q 0 q 3 ) q% - q 2 i + ql - ql 2(q 2 q 3 + q 0 qi) JR 

2 (q iq 3 + q 0 q 2 ) 2 (q 2 q 3 - q 0 q,) q" 0 - q] -q 2 2 + q\ ) 

and all that is necessary is to generate four suitable random numbers. 


(9.8) 


(9.9) 


9.7 Flexible Molecules 

Monte Carlo simulations of flexible molecules are difficult to perform unless the system 
is small or some of the internal degrees of freedom are kept fixed. The simplest way to 
generate a new configuration is to perform random changes to the Cartesian coordinates of 
individual atoms in the molecule but it is a common experience that very small changes 
are needed in order to produce an acceptable Boltzmann factor (in the MMC sense). 
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Molecular mechanics (MM) tends these days to be concerned only with prediction of local 
minima on molecular potential energy surfaces. QSAR properties are often calculated 
in order to assist high-volume screening studies in pharmaceuticals applications. Should 
we want to study the motions of the molecule, all that would be needed would be to 
investigate the normal modes of vibration (which can be obtained from the Hessian). MM 
does not take account of zero-point vibrations and the calculations refer to a molecule at 
0 K, when it is completely at rest. Workers in the modelling field often refer to MM as 
energy minimization. 

As we saw in Chapter 9, Monte Carlo modelling tends to be concerned with thermody¬ 
namic properties. It is now time to consider molecular dynamics which gives a direct way 
to study time evolution. 

I have stressed in previous chapters the intermolecular potential energy, which I chose 
to write as 0 in Chapter 9. Assuming pair wise additivity, 0 can be found by summing 
over all distinct pairs of particles 


®=x;x>. 


( 10 . 1 ) 


If the assumption of pair wise additivity is not valid, then one way forward is to include all 
possible triples and so on 



( 10 . 2 ) 
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In this text, we will generally be concerned with situations where the potentials (and so 
the forces) are pair wise additive. If we focus on one particular particle A, then the mutual 
potential energy of A with all the other particles U A is found from 

Uk = Y, u n 

MA 

and we can find the force on particle A, F A , by differentiating U A with respect to the 
coordinates of particle A: 

F a = —grad U A 

For example, if we consider a pair of Lennard-Jones particles A and B where 

«—((=)'-(£)) 

we note that the potential only depends on the distance between the particles. The expression 
for grad U is particularly simple in this case so that 
31/Rba 
dR A R ba 

which gives 

Newton’s second law connects force and acceleration by 


and in principle we could study the time development of a system by solving this second- 
order differential equation, one such equation for each of the particles in our system. 
Calculating the trajectories of N particles therefore appears to involve the solution of a set 
of 3 N second-order ordinary differential equations. Alternatively, we could use an advanced 
method such as Hamilton’s to solve 6 N first-order differential equations. For any set of N 
particles it is always possible to find three coordinates that correspond to translation of 
the centre of mass of the system, and, if the particles have ‘shape’, three coordinates that 
correspond to rotations about three axes that pass through the centre of mass. 

Most of the early molecular dynamics studies were directed at the problem of liquid 
structure, so that is where we will begin our discussion. 


10.1 Radial Distribution Function 

Of the three states of matter, gases are the easiest to model because the constituent particles 
are so far apart on average that we can ignore intermolecular interactions apart from during 
their brief collisions. This is why researchers were able to bring the kinetic theory of gases 
to such an advanced stage by the end of the nineteenth century (and before the existence 
of a satisfactory theory of molecular structure). 
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The atoms or molecules in a crystalline solid are arranged in a regular order, and for this 
reason we usually start a discussion of the solid state from the properties of regular solids. 
Once such patterns were truly understood at the beginning of the twentieth century, the 
theory of the solid state made rapid progress. 

Liquids are much harder to model and to study experimentally than solids and gases; 
elementary textbooks usually state that liquids show neither complete order nor complete 
disorder. The basis of this remark concerns a property called the radial distribution function 
g(r). Consider Figure 10.1, which is a snapshot of the particles in a simple atomic liquid. 



Figure 10.1 Radial distribution function 

We take a typical atom (the grey one, designated i) and draw two spheres of radii r and 
r + dr. We then count the number of atoms whose centres lie between these two spheres, 
and repeat the process for a large number N of atoms. If the result for atom i is g, (r)dr then 
the radial distribution function is defined as 

g (r) dr = ^ gi (r) dr (10.3) 

This process then has to be repeated for many complete shells over the range of values of 
r thought to be significant. 

In the case of an ideal gas, we would expect to find the number of particles to be propor¬ 
tional to the volume enclosed by the two spheres, which is 47rr 2 dr. This gives g(r) = 4rcr 2 , 
a simple quadratic curve. 

Consider now the simple cubic solid shown in Figure 10.2 whose nearest neighbour 
distance is a. Each atom is surrounded by 6 nearest neighbours at a distance a, 12 at a 
distance s[2a, 8 next-next nearest neighbours at a distance \/3 a, 6 at a further distance 2a 
and so on. We would therefore expect to find a radial distribution function similar to the 
one shown in Figure 10.3. The height of each peak is proportional to the number of atoms 
distant r from any given atom. 
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Figure 10.3 First part of the radial distribution function for a simple solid 
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Radial distribution functions can be deduced experimentally from diffraction studies. 
In the case of a liquid (Figure 10.4), the curve resembles that expected for a solid at low 
temperatures, and at high temperatures it resembles the quadratic expected for an ideal gas. 
At intermediate temperatures, the two features can be clearly seen; essentially a solid pattern 
is superimposed on the gas pattern. This gives the experimental basis for the well-known 
remark about liquid structure quoted above. 



Figure 10.4 Radial distribution function for a liquid superimposed on an ideal gas 


10.2 Pair Correlation Functions 

The radial distribution function for a gas varies as 4m- 1 and so tends to infinity as r tends to 
infinity. It is usual to remove the 4nr 2 dependence by defining a related quantity called the 
pair correlation function g/^lr), which gives information about the probability of finding 
two particles A and B separated by distance r. If the volume of a system is V and it contains 
N a species of type A and N B species of type B, then the number densities are NJV and 
N B /V. The fraction of time that the differential volume elements dri and dr 2 , which are 
separated by distance r, simultaneously contain species of type A and B is given by 
N a N b 

— —£ab (r)dridt 2 

In a mixture of A and B we would be interested in the three distinct pair correlation 
functions gAA(r), ^bbCt) and gAn(r). These pair correlation functions have a limiting value 
of 1 for a fluid. 
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10.3 Molecular Dynamics Methodology 

In an ideal gas, the particles do not interact with each other and so the potential <P is zero. 
Deviations from ideality are due to the interparticle potential, and most of the early studies 
were made on just three types of particle: the hard sphere model discussed in Chapter 9, 
the finite square well and the Lennard-Jones model. 


10.3.1 Hard Sphere Potential 

Just to remind you, the hard sphere potential is the simplest one imaginable; the system 
consists of spheres of radii o and U AR (J?ab) is zero everywhere except when the two spheres 
A and B touch, when it becomes infinite. 

The hard sphere potential is of great theoretical interest not because it represents the 
intermolecular potential of any known substance, rather because any calculations based on 
the potential are simple. 

Alder and Wainwright (1957) introduced the modelling technique now known as molecu¬ 
lar dynamics to the world in a short article in 1957. They reported a study of hard discs, 
the two-dimensional equivalent of hard spheres. 


10.3.2 Finite Square Well 

A subsequent paper by Alder and Wainwright (1959) is usually regarded as the keynote 
paper in the field, and I quote the abstract: 

A method is outlined by which it is possible to calculate exactly the behaviour of several 
hundred interacting classical particles. The study of this many-body problem is carried out 
by an electronic computer that solves numerically the simultaneous equations of motion. 
The limitations of this numerical scheme are enumerated and the important steps in making 
the program efficient on computers are indicated. The applicability of this method to the 
solution of many problems in both equilibrium and nonequilibrium statistical thermodynamics 
is discussed. 


In this second paper they chose a three-dimensional system of particles and the finite 
square well potential shown in Figure 10.5. This potential is especially simple because a 
given particle does not experience any change in velocity except when it is separated from 
another particle by <x 2 (when it undergoes an attractive collision) or <r, (when it undergoes 
a repulsive collision). 

On collision, the velocities are adjusted and the calculation restarts. Statistical data is 
collected every collision. 

In their dynamical calculation, all the particles were given initial velocities and positions. 
Typically, equal kinetic energies with the three direction cosines of the velocity vector were 
chosen at random and the initial positions corresponding to a face-centred cubic lattice. 
Once the initial configuration was set up, they calculated exactly the time at which the first 
collision occurs. The collision time can be found by evaluating for every pair in the system, 
the time taken for the projected paths to reach a separation of or o 2 . 
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Figure 10.5 Finite square well 


If two particles A and B have initial positions r A0 and r B 0 and velocities v A and v B then 
the instantaneous positions at time t will be 


r A = r A , 0 + u A t 
r B = r Bj0 + u B r 


giving 


r A — r B = r Aj0 - r Bj0 + (u A - u B ) t 


and so 

(r A - r B ) 2 = (r A , 0 - r B , 0 ) 2 + 2 1 (r A , 0 - r B , 0 ). (u A - u B ) + t 2 (u A - u B ) 2 
If we rewrite the last equation as a quadratic in t as 


where a takes values 1 or 2, we see that the time required for a repulsive or attractive 
collision is 




In order to find the first collision time, all pairs have to be analysed. All the particles are 
then allowed to move for such time, and the velocities of the colliding pair are adjusted 
according to the equations of motion. 

The finite square well occupies an important place in the history of molecular modelling. 
Real atomic and molecular systems have much more complicated mutual potential energy 
functions, but the finite square well does at least show a minimum. On the other hand, 
because of the finite square well potential, the equations of motion are particularly simple 
and no complicated numerical techniques are needed. There are no accelerations until two 
particles collide. 
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10.3.3 Lennar djonesium 

The first simulation of a ‘real’ chemical system was Rahman’s (1964) study of liquid 
argon. He studied a system comprising 864 Lennard-Jones (L-J) particles under conditions 
appropriate to liquid argon at 84.4K and a density of 1.374gem -3 . Once again, there is 
much to be gained by studying the abstract, so here is the first part of it. 

A system of 864 particles interacting with a Lennard-Jones potential and obeying classical 
equations of motion has been studied on a digital computer (CDC 3600) to simulate molecular 
dynamics in liquid argon at 94.4 K and a density of 1.374 g cm -3 . The pair correlation function 
and the constant of self-diffusion are found to agree well with experiment; the latter is 15% 
lower than the experimental value. The spectrum of the velocity autocorrelation function shows 
a broad maximum in the frequency range co — 0.25(2 7ik B T/h). The shape of the Van Hove 
function G s (r, t) attains a maximum departure from a Gaussian at about t = 0.3 x 10 -2 s and 
becomes a Gaussian again at about 10 -u s... 

There are several interrelated problems. A sample size has to be chosen; this is usually 
determined by the available computer resource and the complexity of the potential function, 
because the potential function has to be calculated very many times during the simulation. 
The number of particles and the density determine the size of the container. At the same 
time we need to decide on a potential function; the natural choice for the inert gases is the 
L-J potential, and we should note that the L-J potential is essentially short range. 

So many early papers used the L-J potential that the noun Lennardjonesium was coined 
to describe a nonexistent element whose atoms interacted via the L-J potential. 


10.4 Algorithms for Time Dependence 


Once we have calculated the potential and hence the force by differentiation, we have to 
solve Newton’s equation of motion. If F A is the force on particle A, whose position vector 
is r A and whose mass is m A then 

Fa = (10.5) 

= m A a A 


This is a second-order differential equation that I can write equivalently as two first-order 
differential equations for the particle position r A and the velocity v A 


( 10 . 6 ) 


10.4.1 Leapfrog Algorithm 


A simple algorithm for integration of these two equations numerically in small time steps 
At can be found by considering the Taylor expansion for v(t): 


v *(' + y) =I * w+ (yr),y 

-KH«-(£),y 



(10.7) 
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Subtracting and rearranging we get 

V A (t + y^ = V A (t - y ^ + a A (0 At H- (10.8) 

I will switch back and forth between (for example) v and dr/df in order to try and improve 
the readability of the equations. Also, I could have written v A (t) or (v A ), to mean the 
instantaneous velocity of particle A at time t. The acceleration a is calculated from the force. 
Using the same procedure for the Taylor expansion of r A at the time point t + Af/2 we get 

r A (t + A t) — r A (t) + v A (t + y^ At + ■ ■ ■ (10.9) 

Equations (10.8) and (10.9) form the so-called leapfrog algorithm, which is reputed to be 
one of the most accurate and stable techniques for use in molecular dynamics. 

A suitable time increment At for molecular dynamics is a femtosecond (10 -15 s). In the 
leapfrog scheme, the velocities are first calculated at time t + At/2. These are used to 
calculate the positions of the particles at time f + At and so on. In this way the velocities 
leap over the positions and then the positions leap over the velocities. 


10.4.2 Verlet Algorithm 


If instead we start from the Taylor expansion of r A (f) we have 

r, (t + At) = r A (0 + A/ + (A0 1 + ■ ■ ■ 

r. (, - A0 = r A (0 - (^f)_ A, + \ (^r)_ <A0 2 + - • 
which gives (assuming that third-order and higher terms are negligible) 
r A (t + At) = 2r A (t) - r A (t - At) + ( ^ ) (At) 2 


This is known as the Verlet algorithm. The acceleration is obtained from the force exper¬ 
ienced by atom A at time t. The velocity does not appear in the expression, but it can be 
obtained from the finite difference formula 

A (t + At) - r A (f - At) 


A (t) = 


2A t 


( 10 . 12 ) 


The Verlet algorithm uses positions and accelerations at time t and the position at time t —At 
to calculate a new position at time t + At. All these have to be stored at every iteration. 

A variant is the velocity Verlet algorithm, which requires only the storage of positions, 
velocities and accelerations that all correspond to the same time step. It takes the form 


r A (, + A,) = r A «) + (^)_ A, + (At) 1 


There are many other algorithms in the literature, each with their own strengths and 
weaknesses. 
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10.5 Molten Salts 

Molten salts (such as the alkali halides) are of great technological interest in the field of 
metal extraction. The first simulations were done in by Woodcock (1971). Molten salts bring 
a new problem because the potential energy terms are long range. Consider a (hypothetical) 
one-dimensional infinite crystal, part of which is shown in Figure 10.6. 


+Q -Q 

•omomomom 


Figure 10.6 Part of a one-dimensional crystal; separation between ions is a 


The unshaded ions have charge —Q, the shaded ions have charge +Q and the spacing 
between the ions is a. Suppose we start with the central (grey) ion at infinity, and all the 
other ions at their lattice positions as shown. The work done W in bringing the grey ion 
from infinity to its place in the lattice is 


and the term in brackets converges very slowly to its limiting value of ln(2): 


2Q 2 


/ 1 1 1 

( 1_ 2 + 3“4 


No. of terms Sum 

1 1.0000 

2 0.5000 

3 0.8333 

4 0.5833 

100 0.6882 


Such series have to be summed when calculating the force on a given ion in a periodic 
box; in the case of neutral species, the sum is very quickly convergent because of the short- 
range nature of the forces. For neutral systems, a cut-off radius is usually taken beyond 
which the interactions are set to zero. 


10.6 Liquid Water 

Water plays a prominent place amongst solvents for obvious reasons. Two-thirds of our 
planet is covered in water, chemical reactions tend to be done in aqueous solution, and 
so on. The relatively strong hydrogen bonding in liquid water causes many of its physical 
properties to be ‘anomalous’, and the structure of ice has long interested both theoreticians 
and experimentalists. 

Neutron diffraction studies on heavy ice, solid D 2 0, have shown that water molecules 
retain their identity in condensed phases with very little distortion of their molecular geo¬ 
metry. This means that water molecules may be treated as rigid asymmetric rotors (with six 
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degrees of freedom) rather than explicitly treating the three nuclei separately (nine degrees 
of freedom). The classical energy for a collection of N rigid rotor molecules consists of the 
kinetic energy for translation and rotation, together with the intermolecular potential. Each 
water molecule is described by six coordinates: three specify the centre of mass and three 
angles that fix the spatial orientation about the centre of mass. In this section, I will denote 
these coordinates by the six-dimensional vector X. In terms of the linear velocities v,-, the 
angular velocity vectors the moments of inertia I, and the coordinates X,, the energy 
turns out to be 



e = 


The question then is the extent to which the intermolecular potential is pair wise additive; 
such functions may always be resolved into contributions from pairs, triples and higher 
contributions: 


U (X lt X 2 ,... X N ) = U (2) (X„ Xj) + U ° } ( X ” X i’ X*) + • • • + U m (X 1; X 2 , ...X N ) 

(10.14) 


In the case of simple fluids such as liquid argon it is widely believed that U is practically pair 
wise additive. In other words, the contributions for p > 2 are negligible. However, the 

local structure in liquid water is thought to depend on just these higher order contributions 
and so it is unrealistic in principle to terminate the expansion with the pair contributions. 
It is legitimate to write the potential as a sum of pair potentials, provided one understands 
that they are effective pair potentials that somehow take higher terms into account. 

The classic molecular dynamics study of liquid water is that of Rahman and Stillinger 
(1971). They wrote the effective pair potential as a sum of two contributions: a L-J 12-6 
potential {/l_j for the two oxygen atoms 



and a function U A , modulated by a function S{R L] ) that depends sensitively on the molecular 
orientations about the oxygen atoms 


t/ (2) (X„ Xj) = U u (/?„) + S (/?,,) U A (X„ X,) 


(10.15) 


The L-J parameters were taken to be those appropriate to neon, er= 282 pm and 
e = 5.01 x 10 -22 J, on the grounds that neon is isoelectronic with water. 

Four point charges Q each of magnitude 0.19e and each 100 pm from the oxygen nucleus 
are embedded in the water molecule in order to give l/ d . Two charges are positive, to 
simulate the hydrogen atoms, whilst the other two are negative and simulate the lone pairs. 
The four charges are arranged tetrahedrally about the oxygen. 

The set of 16 Coulomb interactions between two water molecules gives U A . Figure 10.7 
shows the minimum energy conformation, with the two ‘molecules’ 76 pm apart (i.e. an 
oxygen-oxygen distance of 276 pm). 
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Figure 10.7 Two water molecules 


The so-called switching function prevents the charges overlapping, and takes values 
between 0 and 1 depending on the distance between the oxygen atoms. 

The authors took /V = 216 water molecules in a cube of side 1862 pm, which cor¬ 
responds to a density of lgcm -3 . They took periodic boundary conditions in order to 
eliminate undesirable wall effects, and a cut-off of distance of 916.5 pm beyond which the 
intermolecular interactions were set to zero. 

The first step in a molecular dynamics calculation is to generate a suitable initial state. 
In general, random structures created using density values consistent with experiment may 
give atoms that are very close together. This causes undesirable large repulsive forces 
in small localized regions of space. One approach is to start from a low-density system 
with the atoms or molecules placed on simple cell lattices. The density is then increased 
gradually. 

Starting velocities then have to be assigned. One technique is to calculate a value of the 
speed from equipartition of energy, and then assign random directions to each particle, with 
the constraint that the system has zero overall momentum. 

The next step is to let the system progress for a sufficient number of time steps in order 
to eliminate any undesirable effects due to the choice of initial conditions. 

After that, collection of statistics begins. The temperature is inferred from the average 
values of the translational and rotational kinetic energies, both of which should equal \ Nk B T. 

It is possible in principle to follow a molecular dynamics simulation by displaying 
the atom coordinates at a series of time points. It proves more convenient to calculate 
certain statistical quantities. I mentioned the pair correlation function g <2) (r) earlier; in 
water the three distinct types of nuclear pairs lead to three corresponding functions g 0 o (2> ( r ), 
g 0 H (2) W and gHH <2, (r), which give information about the fraction of time that differential 
volume elements separated by distance r simultaneously contain pairs of the nuclei given 
by the subscripts. These functions all have a limiting value of 1 as r —» oo. The goo <2) (/) 
curve is shown schematically in Figure 10.8. The horizontal axis is actually a reduced 
distance r/o. 

Two important features of the curve are as follows. 

1. The narrow first peak at 275 pm corresponds to 5.5 neighbouring oxygen atoms. 

2. The second peak is low and broad, with a maximum at about 465 pm. The ratio of the 

second peak to the first, 465 pm/275 pm = 1.69 is close to that observed for an ideal ice 

structure = 1.633). 

The mean square displacement is often calculated. This indicates the average displace¬ 
ment of an atom during a fixed time period. 
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Figure 10.8 0-0 pair distribution function 


10.6.1 Other Water Potentials 


Over the years, a number of authors have tackled the problem of finding a suitable pair 
potential for liquid water. All involve a rigid water monomer with a number of interaction 
sites. The original TIPS 3-site model proposed by Jorgensen (1981) has positive charges on 
the hydrogens and a negative charge (q 0 = —2q H ) on the oxygen. The interaction potential 
is taken as the sum of all intermolecular Coulomb terms together with a single L-J 12-6 
term between the oxygens: 


— 1 y^ ^ 

- 47T£o Rij + Roo R 6 oo 


(10.16) 


The parameters q t , A and B were chosen to give reasonable energetic results for gas-phase 
complexes of water and alcohols. 

The author subsequently reoptimized the parameters to give TIP3P, an improved three- 
site model. 

Four-site models have also been used. The oldest is that due to Bernal and Fowler (1933), 
with a more modem version referred to as TIP4P by Jorgensen et al. (1983). In this case, 
the negative charge is moved off the oxygen and towards the hydrogens at a point along the 
bisector of the HOH angle. The pair potential is still calculated according to the equation 
above but more distances have to be evaluated. 

Stillinger and Rahman’s ST2 potential is widely used; this is an improved version of 
the one discussed above, but adds a L-J 12-6 term between the oxygen atoms. In all, 17 
distances have to be evaluated. 


10.7 Different Types of Molecular Dynamics 

When Newton’s equations of motion are integrated, the energy is conserved. Since the 
energy and temperature are related by the equipartition of energy principle, the temperature 
should also be constant. Slow temperature drifts do occur as a result of the numerical 
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integration and also because of the truncation of the forces. Several methods for performing 
molecular dynamics at constant temperature have been proposed, of which the simplest is 
known as rescaling. If the current temperature at time t is T(t) then for a system of N 
particles and 3N degrees of freedom we have 



To adjust the temperature to exactly a reference temperature T rcf we simply rescale the 
velocities by a factor 



Scaling the velocities at appropriate intervals can therefore control the temperature of the 
system. We speak about constant-temperature molecular dynamics. 

Nose (1984) proposed an alternative method that adds an extra degree of freedom referred 
to as a heat bath to the atomic degrees of freedom. Extra kinetic energy and potential energy 
terms are added to the total energy, and the molecular dynamics is carried out with this one 
extra degree of freedom. Energy can flow back and forth between the heat bath and the 
system, and an equation of motion for the extra degree of freedom is solved. 

Similar considerations apply to the pressure; in order to control the pressure in a system, 
any change in volume must be monitored and adjusted. Methods are available, and such 
calculations are referred to as constant-pressure molecular dynamics. 


10.8 Uses in Conformational Studies 

Molecular Dynamics is generally used to study the structure and behaviour of materials, 
where one is concerned with the intermolecular potential. It is of course also perfectly 
possible to study single molecules and it is sometimes used as an aid in conformational 
analysis. As noted previously, there are very many local minima on the potential energy 
surface and it may be that small variations in torsional angles do not facilitate searching of 
all regions. 

The idea is to start with a MM local minimum, which corresponds to a temperature of 
0 K. We use molecular dynamics to raise the temperature, let the molecule vibrate for a 
while at an elevated temperature and cool it back to 0 K. Hopefully the molecule will then 
have crossed over any potential barriers and a reoptimization will give a new structure. 

To give a simple example, consider the straight-chain hydrocarbon C 2 oH 42 , shown in 
Figure 10.9. A MM+ optimization starting from the fully stretched structure gave the 
structure shown. There is restricted rotation about each C-C bond and in a later chapter I 
will show how theories of polymeric molecules focus attention on the end-to-end distance; 
the simplest theories predict relationships between this and the number of C-C links in 
such a molecule. C 20 H4 2 is barely a polymer, but the principles are the same. There are 
very many conformers, all similar in energy. I therefore set up a 15 ps molecular dynamics 
calculation, as shown in Figure 10.9. 
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Figure 10.9 Molecular dynamics calculation on a hydrocarbon 



Figure 10.10 End-to-end distance 
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The options are shown, and statistics were collected in an MS/EXCEL file. The 
end-to-end distance varied dramatically over the experiment, from its initial value of 
2435 pm through 850 pm. The average end-to-end distance was 1546 pm, as shown in 
Figures 10.10 and 10.11. 



Figure 10.11 C 20 hydrocarbon at end of molecular dynamics experiment 


The resulting structure was then optimized. One strategy in such calculations is 
to take snapshots of the structure at regular time intervals and then optimize from 
each snapshot. 
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11 

Introduction to Quantum Modelling 


By the early days of the twentieth century, scientists had successfully developed three of the 
four great cornerstones of physics: Sir Isaac Newton’s mechanics, James Clerk Maxwell’s 
electromagnetic theory and Albert Einstein’s theory of special relativity. They had a picture 
of the physical world where matter was made from point particles and radiation consisted of 
electromagnetic waves, and this picture seemed to explain all known physical phenomena 
with just a few untidy exceptions. 

These untidy exceptions comprised phenomena such as the theory of black body radi¬ 
ation, the photoelectric effect, Compton scattering, atomic structure and spectra and a few 
other apparently unrelated experimental findings. I do not have space in this text to go into 
the historical detail; I will simply say that a thorough study of such phenomena led to the 
fourth cornerstone, quantum theory. Every textbook that has to deal with quantum theory 
has its own particular treatment and starting point, and this one is no exception; I am going 
to assume that you have heard of Erwin Schrodinger and his famous equations, and start 
the discussion at that point. Perhaps I can reassure you by saying that most professional 
scientists perceive quantum theory as a hard subject (along with electromagnetism). Even 
Schrodinger did not fully understand the physical meaning that we now attach to his wave- 
functions when he first wrote down his famous equation and solved it for the hydrogen 
atom. 


11.1 The Schrodinger Equation 

Consider then the simple case of a particle of mass m constrained to the x axis by a potential 
U(x, t ) that depends on x and time 1.1 have allowed the potential to be time dependent in 
order to cover the possibility of (for example) an electron being influenced by an external 
electromagnetic wave. Schrodinger’s time-dependent equation states that 
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■ 

\ 87 x 2 m dx 2 


where j is the square root of — 1 (J 2 = — 1). P (x, t) is the wavefunction and in general it is 
a complex valued quantity. The square of the modulus of P can be found from 

\P (x, t) I 2 = P* (x, t ) P (x, t) 


where P* is the complex conjugate of P. The fact that complex wavefunctions are 
allowed and that the Schrodinger equation mixes up real and imaginary quantities caused 
Schrodinger and his contemporaries serious difficulties in trying to give a physical meaning 
to P. According to Max Bom, the physical interpretation attaching to the wavefunction 
P (x, t) is that P*(x, t)P(x, t )dx gives the probability of finding the particle between x and 
x + Ax at time t. It is therefore the square of the modulus of the wavefunction rather than 
the wavefunction itself that is related to a physical measurement. 

Since P*(x, t)P(x, t)dx is the probability of finding the particle (at time t ) between x 
and x + dx, it follows that the probability of finding the particle between any two values a 
and b is found by summing the probabilities 


P = 


J V* (x, t) V (x, t) dx 


In three dimensions the equation is more complicated because both the potential U and the 
wavefunction P might depend on the three dimensions x, y and z (written as vector r) as 
well as time: 


/_ if /jP _9^_\ 

V 8n 2 m \3x 2 3 y 2 3 z 2 ) 


U(V,i)\P(V,t)=j- 


( 11 . 2 ) 


You are probably aware that the operator in the outer left-hand bracket is the Hamiltonian 
operator and that we normally write the equation 


HP (r, t) =j~ -^ 

2tx d, 

The Bom interpretation is then extended as follows: 


P* (r, t) P (r, t ) dxdydz 


(11.4) 


which gives the probability that the particle will be found at time t in the volume element 
dxdydz. Such volume elements are often written dr. 

I have not made any mention of time-dependent potentials in this text, but they certainly 
exist. You will have met examples if you have studied electromagnetism where the retarded 
potentials account for the generation of electromagnetic radiation. 

For the applications we will meet in this book, the potentials are time independent; in 
such cases the wavefunction still depends on time, but in a straightforward way as I will 
now demonstrate. 
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11.2 The Time-Independent Schrodinger Equation 


I am going to use a standard technique from the theory of differential equations called 
separation of variables to simplify the time-dependent equation; I want to show you that 
when the potential does not depend on time, we can ‘factor out’ the time dependence from 
the time-dependent Schrodinger equation and just concentrate on the spatial problem. I will 
therefore investigate the possibility that we can write 'P (r, t) as a product of functions of 
the spatial (r) and time (f) variables, given that the potential only depends on r. We write 

P (r, t) = if (r) T (t) (11.5) 


and substitute into the time-dependent Schrodinger equation to get 


(-«( 




Notice that the differential operators are partial ones and they only operate on functions 
that contain them. So we rearrange the above equation 

/ h 2 / 3 2 3 2 d 2 \ \ h dT (t) 

Tl »(-** w ■ + w + *0 ■ + ' Vlr - V * (r) mJ * (r) s -ir 

divide by P (r, t) and simplify to get 


i / y / 8 2 jP d*\ 

f (r) \ 87r 2 m \ dx 2 dy 2 dz 2 ) 


. 1 h dT(t) 

(r, t) ) i/r (r) = j - 

^ ’ 1 v w J T(t)2jt d t 


The left hand side of the equation depends on the spatial variable r and the time variable t. 
Suppose now that the potential depends only on r and not on t. That is to say 

Under these circumstances, the left-hand side of the equation depends only on r and the 
right-hand side only on t. The spatial and the time variables are completely independent 
and so the right-hand side and the left-hand side must both equal a constant. In the theory 
of differential equations, this constant is called the separation constant. In this case it is 
equal to the total energy so I will write it as e: 

1 / h 2 ( d 2 d 2 d 2 \ \ 

?w( _ iA;(»? + v + w) T (r)= ‘ 


J T (f) 2 n d t 

The second equation can be solved to give 

T(t) = A exp 


(_2nf 

V h 


2njEt\ 


and so the full solution to the time-dependent equation in the special case that the potential 
is time independent can be written 

W (r, t) = f (r) exp 


( 11 . 8 ) 
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Probabilities then become time independent since 

** (r, t) * (r, 0 = r (r) exp ( * (r) exp 
= r (r) if (r) 


11.3 Particles in Potential Wells 

I have mentioned a number of model potentials in earlier chapters when dealing with 
molecular dynamics and Monte Carlo. A model potential of historical interest in quantum 
mechanical studies is the ‘infinite well’. 


11.3.1 One-Dimensional Infinite Well 

A particle of mass m is constrained to a region 0 < x < L by a potential that is infinite 
everywhere apart from this region, where it is equal to a constant U 0 (Figure 11.1). This 
might be a very crude model for the conjugated electrons in a polyene, or for the conduction 
electrons in a metallic conductor. (In fact there is a technical problem with this particular 
model potential caused by the infinities, as I will explain shortly.) 


Figure 11.1 One-dimensional infinite well 


To solve the problem, we examine the Schrodinger equation in each of the three 
regions and then match up the wavefunction at the boundaries. The wavefunction must 
be continuous at any boundary. 

In regions I and III the potential is infinite and so the wavefunction must be zero, in order 
to keep the Schrodinger equation finite. In region II the potential is a finite constant U 0 and 
we have 

( h 2 d 2 \ 
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which can be rewritten 




(11.9) 


The standard solution to this second-order differential equation is 


\js = A sin 


'8jr 2 m (e — U 0 ) 

h 2 



where A and B are constants of integration. We know that is = 0 in regions I and III so the 
wavefunction must also be zero at x = 0 and at x = L in order to be continuous at these 
points. Substituting i/r = 0 at x = 0 shows that B = 0. If we substitute i// = 0 at x = L 
we get 



This means that either A — 0 or the argument of sin must be an integral multiple of it 
(written ri). A cannot be 0, for otherwise the wavefunction would be 0 everywhere, and so, 
on rearranging 



( 11 . 10 ) 


The energy is quantized, and so I have added a subscript to the e for emphasis. It generally 
happens that the imposition of boundary conditions leads to energy quantization. It is often 
necessary when tackling such problems to match up both the wavefunction and its gradient 
at the boundaries, but in this particular case the solution appeared just by considering the 
wavefunction. (In fact, the infinities in this particular potential mean that the first derivative 
is not continuous at the boundaries; ‘real’ potentials do not contain such sudden jumps to 
infinity.) 

Parameter n is the quantum number and at first sight the allowed values are n = ... — 2, 
— 1,0,1,2,... The value n = 0 is not allowed for the same reason that A cannot be 0. We 
can deduce the constant of integration A by recourse to the Born interpretation of quantum 
mechanics; the probability of finding the particle between x and x + dx is 


fl (x) dx 


Probabilities have to sum to 1 and so we require 


J fl (x) dx = 1 


from which we deduce 
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We notice also that the values of ±n give wavefunctions that differ only by multiplication 
by -1, since 


sin (-"]r) = - sin Gr) 


According to the Bom interpretation, both give the same probability density and so we 
only need one set. It is conventional to take the quantum numbers as positive and the value 
of A as the positive square root so the results can be collected together as 


[2 /rmx\ 

• = Vz sm (-r) 


Plots of the squares of the wavefunctions for n m 1 and n = 3 are shown in Figure 11.2. 
For a particle with n = 1 there is zero probability of finding the particle at the end points 
and a maximum probability of finding the particle at the centre point. For n = 3 there are 
three equal maxima of probability. 



11.4 Correspondence Principle 

In 1923, Neils Bohr formulated a principle that he had already used in his treatment of the 
hydrogen atom, called the correspondence principle', this states that quantum and classical 
predictions have to agree with each other in the limit, as the quantum number(s) get large. 
In other words, the results of classical physics may be considered to be the limiting cases of 
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quantum mechanical calculations with large quantum numbers. A plot of i/f 3 2 0 (Figure 11.3) 
shows the operation of the correspondence principle; the probabilities appear much more 
evenly spread out over the region. In the case of a ‘classical’ particle in such a well, we 
would expect an equal probability of finding the particle at any point along the line. 



0 0.2 0.4 0.6 0.8 1 

x/L 


Figure 11.3 Square of wavefunction n = 30 


11.5 Two-Dimensional Infinite Well 

In the two-dimensional version of this problem, we consider a single particle of mass m 
constrained to a region of the xy plane by a potential that is infinite everywhere except for 
a region where 0<x<oo, 0<y<oo where it is equal to a constant U 0 , as shown in 
Figure 11.4.1 have taken the region to be square, but this is not necessary. 


U infinite 

y=L 
0 

x=0 x=L 



Figure 11.4 Two-dimensional infinite well 
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The (time-independent) Schrodinger equation is 

g + = 0 (11.12) 

dx- 3 y 1 h- 

and the wavefunction now depends onx andy. Straightforward application of the ‘separation 
of variables’ technique discussed above gives the following wavefunctions and energies: 


n 2 h 2 k 2 h 2 

£ - k = Uo+ ^L 2 + SM 2 



n,k= 1,2,3,... 


There are now two quantum numbers that I have decided to call n and k. Each quantum 
number can equal a positive integer. The wavefunction is a product of sine functions, and 



Figure 11.5 Square of wavefunction, contour view 
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we now have to think about visualization. Two popular ways to visualize such objects are as 
contour diagrams or as surface plots. The two representative Figures 11.5 and 11.6 refer to 
the solution n = 4 and k = 2, and in keeping with the importance of the Bom interpretation, 
I have plotted the squares of the wavefunctions. The horizontal axis is x, and the vertical 
axis is y. 



Figure 11.6 Two-dimensional box, surface plot 


11.6 Three-Dimensional Infinite Well 

It should be obvious by now how we proceed to three dimensions. The relevant Schrodinger 
equation is 

JT + JT + JT + — * = 0 (1U4) 

dx 2 3 y 2 dz 2 h 2 

which can be solved by the standard techniques discussed above. Application of the bound¬ 
ary conditions gives three quantum numbers that I will write n, k and l. The solutions 
are 


k 2 h 2 

8mL 2 


Ph 2 

8mL z 




(11.15) 


i,k,l= 1,2,3,... 
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Figure 11.7 summarizes the one-dimensional and the cubic three-dimensional infinite 
well problems. The energies are plotted in units of h 2 /SmL 2 . The one-dimensional energy 
levels are shown as the left-hand ‘column’ of the figure. They simply increase as n 2 . 
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Figure 11.7 One-dimensional and cubic three-dimensional wells 


The three-dimensional problem shows a much more interesting pattern. Firstly, many of 
the energy levels are degenerate (that is to say, there are many quantum states with the same 
energy). Some of the degeneracies are natural , for example ^1,2,3. ^1,3,2, ^2,1,3, ^2,3,1. Vriu 
and Vf 3j 2,1 all have energy \4h 2 /(8mL 2 ). There are also many accidental degeneracies; for 
example there are 9 degenerate quantum states with energy 41/z 2 /(8wL 2 ) because l 2 +2 2 -|-6 2 
and 4 2 + 4 2 + 3 2 are both equal to 41. Secondly, as the energy increases, the energy levels 
crowd together and the number of quantum states of near equal energy is seen to increase. 
This crowding together does not happen in the one-dimensional case. 

The square two-dimensional case lies somewhere in between, with fewer degeneracies. 

The degeneracies do not occur if we take arbitrary sides for the two- and three- 
dimensional regions. Nevertheless, their quantum states still crowd together as the quantum 
numbers increase. 
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11.7 Two Noninteracting Particles 

Consider now the case of two noninteracting particles A and B in a one-dimensional 
well. Writing x A and x B for their position coordinates along the x axis, the time- 
independent Schrodinger equation is (taking the constant of integration to be zero, for 
simplicity) 

(ITT + Sj] * (*a,*b) = E* {x A ,x B ) (11.16) 

67T z m \OX A OX B ) 

I have written capital and E to emphasize that the equation refers to a system of two 
particles, and I will use this convention from now on. The equation looks like a candidate for 
separation of variables, so we investigate the conditions under which the total wavefunction 
can be written as a product of one-particle wavefunctions 


^ (*a.*b) = Va (Xa) iAb (*b) 


Substitution and rearrangement gives 

h 2 1 d 2 V^a h 2 1 d 2 V^B 

8n 2 m \js A dx\ 8 jr 2 m \Jr B dx B 


Applying the now familiar argument, we equate each term on the left-hand side to a (dif¬ 
ferent) separation constant e A and e B , which have to sum to E. The total wavefunction is 
therefore a product of wavefunctions for the individual particles, and the total energy is the 
sum of the one-particle energies. There are two quantum numbers that I will call n A and n n 
and we find 



n A ,n B = 1,2,3,... 


We often summarize the results of such calculations, where the solution turns out to 
be a product of individual one-particle solutions, by a simple extension of Figure 11.7. 
Figure 11.8 shows one possible solution, where particle A has quantum number 2 and 
particle B has quantum number 1. There is more to this simple problem than meets the 
eye; you might be wondering how it is possible to ‘label’ one particle A and one B. If A 
and B had been two motorcars travelling along a motorway then the labelling could be 
done by the vehicle number plate. But how can we possibly label electrons, or any other 
atomic particles for that matter? This is not an easy question; I will come back to it later 
in the book. 
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Figure 11.8 Two particles in a one-dimensional box 


11.8 Finite Well 


The infinite well occupies an important place in the history of molecular quantum mechan¬ 
ics, and we will meet it again in Chapter 12. We discussed the finite well in Chapters 9 and 
10, but from the viewpoint of classical mechanics. The finite well potential gives a rather 
crude model for a diatomic molecule, and it is shown again in Figure 11.9. It keeps an 
infinite potential for the close approach of the two nuclei, but allows for a ‘valence region’ 
where the potential is constant. 

The potential is infinite for — oo < x < 0, zero (or a constant) for the valence region 
where 0 < x < L and a constant D for L < x < oo. As usual, we divide up the x axis into 
three regions and then match up i jr at the two boundaries. It is also necessary to match up 
the first derivative of \jr at the x = L boundary. 

In region I, the potential is infinite and so i/r = 0. This means that i/r = 0 at the boundary 
when x = 0. 

In region II, the potential is constant (zero) and so we have 


k 2 

87r 2 m 


This can be solved to give 
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Figure 11.9 The finite box 


where A and B are constants of integration. We can eliminate the B term straight away by 
the argument of Section 11.3. 

In region III we have 

(~^^ +D )' ,rW * MrW 


For now, let us interest ourselves only in those solutions where the energy is less than or 
equal to D. Such solutions are called bound states. We will shortly see that solutions exist 
where the energy is greater than D and these are called unbound states. I can rewrite the 
Schrodinger equation for region III as 


d 2 f 

~dx 2 


h 2 

8n 2 m 


(D-s) f = 0 


The standard solution to this second-order differential equation is 
\[r = E exp 



where E and F are constants of integration. 

We now have to apply boundary conditions for the bound state solutions where e is 
less than D. If we want the wavefunction to be zero at infinity, then E must be zero. The 
wavefunction and its first derivative must also be equal at the boundary x = L. Thus we have 
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Fexp(— 

Dividing one equation by the other and rearranging gives a formula involving the 
energy e: 



The energy is quantized and the precise values of e depend on D and L. 

11.9 Unbound States 

I mentioned in the discussion above that we would be concerned only with solutions where 
the energy e was less than the well depth D. Such states are called bound states, by which 
one means that the particle is somehow localized to the potential well in the sense that the 
probability of finding the particle between x and x+dx should decrease the further we move 
away from the potential well (you probably noticed that I assumed the wavefunction should 
fall off exponentially, and so I deliberately excluded the possibility that the probability could 
get larger with distance). 

Depending on L and D, a one-dimensional finite well can have one, many or an infinite 
number of bound state solutions. Finite wells can also have unbound solutions that are 
perfectly acceptable in so far as both i/r and its first derivative are finite and continuous for 
all values of x, but they do not necessarily have the property that they tend to zero for large 
values of x. 

Taken individually these solutions cannot represent the state of a particle because they 
do not vanish for large values of x. This causes a real problem with the Bom interpretation 
because they cannot be made to satisfy the normalization condition. The solution to this 
problem is to combine waves together into wave packets, and this topic is discussed in more 
advanced textbooks. 



11.10 Free Particles 


The simplest example of unbound states is afforded by studying free particles; consider 
again the one-dimensional problem of a particle (of mass m) and a potential that is a constant 
which I will take to be zero. The time-independent Schrodinger equation is 
/ h 2 d 2 \ 

(-8^d^j^ W =^ W 


which has solutions that I can write 


h 


\jr (x) = A exp (jkx ) where k = ±- 
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Here k can take any value; the energy e is positive, but now it has a continuum of possible 
values. The time-dependent solution is therefore 

>P (x, t) = A exp (jkx) exp 


(-T) 


which can be written 


'P (x, t) — A exp (j (kx - cot)) 

where 


2 TVS 

CO= ~hT 

This is a classic one-dimensional wave equation, and it corresponds to a de Broglie wave. 
In three dimensions we write 


(r, t) = A exp (/' (k.r — cot)) 


(11.19) 


where k is the wavevector. You will have encountered the wavevector if you have studied 
electromagnetism beyond elementary level. 

We notice then that for each value of the energy, there are two possible wavefunctions 


iff (x) = A exp 


i /f (x) = A exp 



( 11 . 20 ) 


That is, there is a double degeneracy. A feature of the free particle wavefunctions is that 
they cannot be made to satisfy the Bom normalization condition, because there is no value 
of A that will satisfy the ‘equation’ 

|A| 2 / \fi(x)\ 2 dx = 1 

This is not a trivial problem, and workers in the field often interpret solutions of the form 
of Equation (11.20) as an infinite beam of noninteracting particles moving to the right or 
left (according to the + or — sign) with a constant number of particles per length. 


11.11 Vibrational Motion 


I gave you the classical treatment of a diatomic molecule in Chapters 4 and 5, where I 
assumed that the relevant potential was Hooke’s law. I demonstrated that the vibrations of 
a diatomic molecule (atomic masses m, and m 2 ) were exactly equivalent to the vibrations 
of a single atom of reduced mass /z where 


If I write the ‘spring’ extension R - R t as x, then the vibrational Schrodinger equation 
can be written 
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d 2 i jr 
to 2 " 


87T 2 yU . 

h 2 


( s - 


= 0 


( 11 . 21 ) 


where k s is the force constant. The energy e and wavefunction i/r now describe the nuclear 
vibrations. This is a more difficult differential equation that the ones discussed earlier in this 
chapter, but fortunately it is well known in many areas of engineering, physics and applied 
mathematics. Detailed solution of the vibrational Schrodinger equation is given in all the 
advanced texts and to cut a long story short we interest ourselves in bound states and so 
impose a condition i/r —> Oasx —» ±oo. This gives rise to energy quantization described by 
a single quantum number that is written v. The general formula for the vibrational energies is 


'• = SVf( V+ 0 (,1.22) 

v = 0,1,2,... 


The harmonic oscillator is not allowed to have zero vibrational energy. The smallest 
allowed energy is when v = 0, and this is referred to as the zero point energy. This finding 
is in agreement with Heisenberg’s uncertainty principle, for if the oscillating particle had 
zero energy it would have also zero momentum and would be located exactly at the position 
of the minimum in the potential energy curve. 

The normalized vibrational wavefunctions are given by the general expression 

( _ v 1/2 

// V (?)exp(-^Q (11.23) 


2ns/k s p 




The polynomials H v are called the Hermite polynomials ; they are solutions of the second- 
order differential equation 

0-2*f+2v* = 0 (11.24) 

and are most easily found from the recursion formula 


H v+l (?) = 2?fl v (?) - 2vH v _ l (?) 


The first few are shown in Table 11.1. 

Squares of the vibrational wavefunctions for v = 0 and v = 5 (with numerical values 
appropriate to 14 N 2 ) are shown as Figures 11.10 and 11.11. 


Table 11.1 The first few Hermite polynomials /-?(?) 


V 

H v (?) 

0 

1 

1 

2 ? 

2 

4? 2 — 2 

3 

8? 3 — 12 ? 

4 

16? 4 — 48? 2 + 12 

5 

32? 5 — 160? 3 + 120 ? 
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Variable 

Figure 11.10 Square of vibrational wavefunction v = 0 for dinitrogen 



Variable 


Figure 11.11 Square of vibrational wavefunction v — 5 for dinitrogen 
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1.4 -10 s 



-10 -5 0 5 10 


Variable 


Figure 11.12 Square of vibrational wavefunction v = 30 for dinitrogen 


The classical model predicts that the maximum of probability is when the particle is at the 
ends of its vibrations (when it is stationary). The quantum mechanical model seems quite 
different, since the v = 0 solution has a probability maximum half way along the bond. 
The operation of the correspondence principle should be clear once we consider higher 
quantum numbers. Even by v = 5 it is becoming clear that the maxima are moving toward 
the ends of the vibration. Examination of v = 30 as shown in Figure 11.12 confirms the 
trend, which in any case can be deduced analytically. 











12 

Quantum Gases 


In Chapter 11,1 discussed the quantum mechanical description of a particle in a selection 
of potential wells, including the three-dimensional infinite cubic box of side L. For a single 
particle of mass m, I showed that the energy is quantized and that each quantum state is 
described by three quantum numbers n, k and l as follows: 


n 2 h 2 k 2 h 2 l 2 h 2 



n,k,l = 1,2,3,... 


We often set the constant of integration U 0 to zero, without any loss of generality. We will 
make this simplification for the rest of the chapter. 

Consider now an ideal gas comprising N such particles, constrained to the cubic three- 
dimensional infinite potential well. Ideal gas particles do not interact with each other and 
the total wavefunction is therefore a product of one-particle wavefunctions (orbitals), as 
discussed in Chapter 11. Also, the total energy is the sum of the one-particle energies. 

Suppose we arrange that the system is thermally isolated and then make a measurement 
on the system to find how the energy is distributed amongst the particles at (say) 300 K. 
We will find some particles with low quantum numbers and some with high. The kinetic 
energy of particle i is given by 


Si = (n 2 + k 2 + l 2 ) 


h 2 

%mU 


and the total kinetic energy is therefore 


£kta - s m r 2 H H 
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This can be identified with the thermodynamic internal energy C th 




(12.3) 


We do not yet know how the kinetic energy will be divided out amongst the particles, all we 
know is that the summation will be a complicated function of the quantum numbers. For 
the sake of argument, call the summation A. Since the box is a cube of side L, the volume 
V of the box is L 3 and so we can write 

C/th = —AV~ 1/3 (12.4) 

8 m 

We know from elementary chemical thermodynamics that for a thermally isolated system 


P = ~ 


and so the particle in a box model gives the following relationship between pressure and 
volume: 


which can be rearranged together with the expression for U lh to give 


(12.5) 


PV=\u m 


( 12 . 6 ) 


That is to say, the pressure times the volume is a constant at a given temperature. Not only 
that, but if the internal energy of the sample (of amount n ) is given by the equipartition of 
energy expression \nRT, we recover the ideal gas law. 


12.1 Sharing Out the Energy 

Statistical thermodynamics teaches that the properties of a system in equilibrium can be 
determined in principle by counting the number of states accessible to the system under 
different conditions of temperature and pressure. There is as ever some interesting small 
print to this procedure, which I will illustrate by considering three collections of different 
particles in the same cubic infinite potential well. First of all, I consider distinguishable 
particles like the ones discussed above that have mass and quantized energy. I raised the 
question in Chapter 11 as to whether or not we could truly distinguish one particle from 
another, and there is a further problem in that some particles have an intrinsic property 
called spin just as they have a mass and a charge. 

We will then extend the ideas to photons (which have momentum but no mass), and 
finally consider the case where the particles are electrons in a metal. 

We noted in Chapter 11 two differences between a one-dimensional and a three- 
dimensional infinite well: firstly, the three-dimensional well has degeneracies both natural 
and accidental; secondly, the three-dimensional quantum states tend to crowd together as 
the quantum numbers increase. If we consider a box of 1 dm 3 containing (for example) 20 Ne 
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atoms at 300 K, then each atom will have \k R T kinetic energy, according to the equiparti- 
tion of energy principle. A short calculation suggests that we will find typically quantum 
numbers as high as 10 9 ; energy is certainly quantized, but the separation between the energy 
levels is minute in comparison to k B T. Under these circumstances, it is possible to treat 
the energy levels and the quantum numbers as if they were continuous rather than discrete. 
This is called the continuum approximation. 

Direct calculation shows how the number of quantum states lying in a given small interval 
increases with energy (Table 12.1). 


Table 12.1 Properties of quantum states. Here E = h 2 /8mL 2 


Number of quantum states with energy lying 
between 

(100 and 110) £ 

85 

(1000 and 1010) £ 

246 

(10000 and 10010) £ 

1029 

(100 000 and 100 010) £ 

2925 

(1 000 000 and 1 000 010) £ 

8820 


What matters in many experiments is not the precise details of an energy level diagram, 
rather a quantity called the density of states Die) defined such that the number of states 
between e and e + de is D(e)ds. 

Figure 12.1 shows a plot of the number of states versus the square root of the energy, 
and it seems that there is a very good linear relationship. 



Square root of energy 

Figure 12.1 Square root relationship 


I want now to establish that this is indeed the general case, and I can do this using a 
process called Rayleigh counting. 
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12.2 Rayleigh Counting 

First of all I plot out the quantum numbers n, k and l along the x, y and z Cartesian axes as 
shown in Figure 12.2. 



For large quantum numbers the points will be very close together and there will be 
very many possible combinations of the three particular quantum numbers n, k and l that 
correspond to the same energy 


According to Pythagoras’ theorem, each combination will lie on the surface of a sphere of 
radius r where 


r 2 = n 2 + k 2 + l 2 


We therefore draw the positive octant of a sphere of radius r, as shown. The volume of the 
octant also contains all other combinations of the three quantum numbers that have energy 
less than or equal to the value in question. We only need consider this octant, since all 
quantum numbers have to be positive. The number of states with energy less than or equal 
to e is 


1 4tt 
8 ~3~ 


(n 2 + k 2 + l 2 ) 


3/2 


or in terms of the energy 


1 47T 8 mL 2 

8 ~3 If' 


I now draw a second octant of radius r + dr, shown in Figure 12.3, which contains all 
quantum states having energy less than or equal to e + de. The volume of the outer sphere is 


1 4jt Sml'} 

8~3 W 


(e + de) 3/2 
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Figure 12.3 Rayleigh counting continued 


The number of states having energy between e and e + de is given by the volume 
contained between the two spheres 


1 An 8mL 2 ,,, 1 47r 8 mL 2 

8 3 h 2 K ’ 8 3 h 2 


,3/2 


(12.7) 

which is by definition D(e)Ae. Use of the binomial theorem on Equation (12.7) gives 
8 mL 2 

■'‘de 

( 12 . 8 ) 


2nmL 2 

~ m £ 


l/2 de 


I will write the result as 

D (e) = Be' 11 


(12.9) 


12.3 Maxwell-Boltzmann Distribution of Atomic Kinetic Energies 


It is possible to perform experiments that measure the kinetic energies of atoms in gaseous 
samples. Once again we note that there are so many particles present that the quantities of 
interest are the mean and standard deviation, rather than the properties of every individual 
particle. We normally study variation of a quantity 


/ = 


1 AN 
N~de 


( 12 . 10 ) 


with energy £ at a fixed thermodynamic temperature. Here, N is the number of atoms in 
the sample. The related quantity 

AN 
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is referred to as the distribution function of the kinetic energy. When the experiments are 
carried out, we find a characteristic spread of kinetic energies as shown in Figure 12.4. The 
distribution is independent of the type of atom, the solid curve refers to a temperature of 
500 K and the dashed curve refers to a temperature of 1500 K. The peak moves to higher 
energy as the temperature increases, but the area under the curve remains constant. 



Energy/J 

Figure 12.4 Distribution of atomic kinetic energies 


The quantity g(e ) gives the number of atoms whose energies lie between e and e + de. 
The number of atoms whose energies lie between e A and e B is therefore 

J g (e) de 

and the total number of atoms is given by 

N = J g (e) de 

I can give a very simple explanation of Figure 12.4, on the assumption that the sample of 
particles is constrained to a cubic three-dimensional infinite well. My explanation depends 
on the continuum approximation and so will only work for ordinary temperatures, for at 
low temperatures the spacing between quantum states will be comparable to k B T and so the 
continuum approximation will not apply. Also, it will not work for anything but an atom 
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since molecules have rotational and vibrational energies in addition to translational kinetic 
energy. 

Apart from the density of states mentioned above, D{s) = Be 1/2 , we have to consider 
the average occupancy of each quantum state. This is given by the Boltzmann factor 



where A is a constant and N the number of atoms. Combining the two expressions we get 



We can eliminate the constants A and B in terms of N by use of the equations above and 
we find after rearrangement 



( 12 . 11 ) 


which gives exact agreement with the experimental curves. It can be established by dif¬ 
ferentiation of Equation (12.11) that the peak occurs at e = \k B T, and it can also be 
established by integration that the average energy per particle < £ > is \k B T, in accord 
with the equipartition of energy principle. 


12.4 Black Body Radiation 

A study of black body radiation was a milestone in the path to our modern theory of quantum 
mechanics. The term black body radiation refers to the electromagnetic radiation emitted 
by a ‘perfect emitter’ (usually a heated cavity). 

The curves in Figure 12.5 relate to the energy U emitted by a black body of volume V 
per wavelength X at temperatures of 1000 K (bottom curve), 1500 K (middle curve) and 
2000 K (top curve). The quantity plotted on the y axis is 


1 dU 
V dl 


and it is observed that: 

• each of the curves has a peak at a certain maximum wavelength A. max ; 

• A max moves to shorter wavelength as the temperature increases. 

These curves have a quite different functional form to the atomic kinetic energy curves 
shown in Figure 12.4. Max Planck studied the problem and was able to deduce the following 
expression that gives a very close fit to the experimental data: 



( 12 . 12 ) 


For present purposes, it proves profitable to think of the problem in the following way. 
We know that the heated cavity contains photons, and that the energy of a photon of 
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Figure 12.5 Black body radiation 


wavelength A. is hc 0 /X. We also know that d U = ed N = sg(s)ds. The experiment actually 
gives a distribution function g(e) such that the number of photons with energies between e 
and e + de is g(e)ds. Equation (12.12) can be developed to give 


8(b) = 


8nV , 1 



(12.13) 


If we interpret this expression as 

g(s) = (density of states) x (average occupancy of a state) 

we see that the density of states for photons is proportional to e 2 whilst the average 
occupancy of a state is proportional to 



rather than the Boltzmann factor. This factor is referred to as the Bose factor after S.N. 
Bose. 

I can explain the density of states quite easily; photons have zero mass and so the energy 
formula (Equation (12.1)) for a particle in a three-dimensional infinite well 
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cannot apply. The correct formula turns out to be 


(12.14) 


There are still three quantum numbers, but the formula does not include a mass. As the 
energy increases, the quantum states become numerous and cram together. We can evaluate 
the density of states just as before by taking a narrow range of energies e and e + de, and 
counting the number of quantum states. We find 

D (e) oc e 2 

that is to say, the photon states crowd together more quickly than for a classical particle in 
such a well. A detailed calculation gives the proportionality constant 


(12.15) 


where V is the volume of the container. This explains the density of states, but not why the 
Bose factor is different from the Boltzmann factor. 


12.5 Modelling Metals 

We usually divide solids into conductors (materials like copper that conduct electricity), 
insulators (materials like glass that do not conduct electricity) and semiconductors that 
have properties in between. We visualize metallic conductors as comprising a rigid array 
of cations surrounded by a sea of mobile electrons, as shown in Figure 12.6. 



Figure 12.6 Simple model of a metallic conductor 
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12.5.1 The Drude Model 

The first attempt to model metals was the free electron model developed independently 
by P. Drude and by Lorentz. According to this model, each metallic atom in the lattice 
donates its valence electrons to form a sea of electrons. So a sodium atom would lose one 
electron and an aluminium atom three; these conduction electrons move freely about the 
cation lattice points, rather like the atoms in an ideal gas. 

The model had some early successes; electrical conduction can be explained because the 
application of an electric field to the metal will cause the conduction electrons to move from 
positions of high to low electrical potential and so an electric current can be established. In 
fact. Ohm’s law can be recovered from the Drude model with a little analysis. 

Eventually, the Drude model gradually fell into disrepute. An obvious failing was its 
inability to explain the heat capacity of metals. If the model of solid aluminium is correct 
and each A1 atom loses three valence electrons, we might expect that these valence electrons 
would make a contribution to the internal energy of 3 x (3/2 )RT, with a corresponding molar 
heat capacity of (9/2 )R (i.e. 32 J K -1 mol -1 ). In fact, A1 obeys the Dulong-Petit rule with a 
molar heat capacity of about 25 J K -1 mol -1 , so the equipartition of energy principle seems 
to overestimate the contribution from the conduction electrons. 

The electrons are of course in motion, and we should ask about their energy distribution 
just as we did in earlier sections. There is no experimental technique that can measure the 
spread with the same accuracy, as can be done with photons in the black body experiment, or 
with particles in the kinetic theory experiments. A variation on the theme of the photoelectric 
effect (called high-energy photoemission) can however be used to probe electron kinetic 
energies in a metal. If I write the familiar expression for the distribution function g(e) 


g(s) = (density of states, D(s)) x (average occupancy of a state, p(e)) 


then it turns out that the density of states is proportional to e l/2 , just as it was for the spread 
of kinetic energies in an ideal gas. The average occupancy factor is quite different from 
anything we have met so far: 


p(e) - 


Parameter e F is called the Fermi energy, and I have illustrated the factor for the case of a 
Fermi energy of 2.5 eV at 600 and 6000 K in Figure 12.7. The lower temperature shows a 
factor of 1 for all energies until we reach the Fermi level, when the factor becomes zero. 
The lower the temperature, the sharper is the cut-off. The corresponding Boltzmann factors 
are shown in Figure 12.8, from which it is seen that the two sets of results are radically 
different. 

The experimental distribution function is therefore 


g(e) = Be 1/2 - 


/ (e — e F ) \ , 


(12.16) 


and the average occupancy of states factor cannot be explained by the Drude model. 
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Figure 12.8 Corresponding Boltzmann factors 

On the other hand, it could be argued that the ideal gas model is inappropriate because 
charged particles exert tremendous long-range forces on each other. Not only that, there 
are frequent collisions between all the charged particles. However, analysis shows that the 
problem lies with the classical Drude treatment, not with the ideal gas model. 
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12.5.2 The Pauli Treatment 

It was W. Pauli who devised a quantum mechanical treatment of the electron gas; in Pauli’s 
model, the conduction electrons are still treated as an ideal gas of noninteracting particles, 
but the analysis is done according to the rules of quantum mechanics. My starting point is 
the energy level diagram for a cubic three-dimensional infinite potential well (Figure 12.9). 



Pauli assumed that the metal could be modelled as an infinite potential well, and that the 
electrons did not interact with each other. He also assumed that each quantum state could 
hold no more than two electrons. At 0 K, the N conduction electrons fill up the N/2 lowest 
energy quantum states. I have shown 8 conduction electrons in Figure 12.9, for illustration. 
The highest occupied quantum state corresponds to the Fermi level, and the Fermi factors 
are therefore 


P 0) = 


1 for s < e F 
0 for e > £ f 


At higher temperatures, the electrons will be spread amongst the available quantum states, 
but subject still to the restriction that each quantum state can hold a maximum of two 
electrons. For electron energies much above the Fermi level so that e — e F k R T, the 
Fermi factor can be approximated by ignoring the 1 in the nominator to give 

P W-exp(-^) 
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which is very similar to the Boltzmann factor. For sufficiently large electron energies, the 
Fermi factor reduces to the Boltzmann factor. For electron energies below the Fermi level, 
the factor can be written 

,,(E) " 1 -“ p (xr) 

which simply tells us that nearly all the states are fully occupied. 

We have therefore reached a point where three different types of particles in the same 
three-dimensional infinite potential well show very different behaviour (Table 12.2). 


Table 12.2 Density of states and average occupancies 


Particle 

Density of states 
proportional to 

Average occupancy 
proportional to 

'Classical' ideal gas 

e m 


Photons in cavity 

e 2 

eXP fe ) -1 

Electrons in metal 

g 1/2 

e *p(,Tfr) + ' 


12.6 Indistinguishability 


Let me now return to the infinite one-dimensional well, with just two noninteracting particles 
present. I labelled the particles A and B, and showed in Chapter 11 that the solutions were 


£„a,„ b 


2 . /n A ttx a \ . / n B jrx B \ 
L Sm \ L ) Sm \ L ) 
/ 2 2 \ h 1 

("a + "b) 8 tnL 2 


n A ,« B = 1,2,3,... 


(12.17) 


Suppose one particle has a quantum number of 1, whilst the other has a quantum number 
of 2. There are two possibilities 



(12.18) 
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and they are degenerate (they have the same energy). In such situations, it turns out that 
any linear combination of degenerate solutions is also an acceptable solution. This is easily 
seen by rewriting the problem in Hamiltonian operator form 


= EV 1,2 

H<E 2?1 = E*P 2 ,i 

H (a ^ lj2 + bV v ) = E (a'P h2 + b^i) 


In the above equation, a and b are nonzero scalars that could be complex. 

We live in a world of mass production where items that come off a production line appear 
to be identical. A shopping trip to the supermarket might yield four tins of dog food, all with 
the same label, the same contents and the same price, but on close examination it usually 
turns out that there are differences. One tin might have a scratch, one might be slightly 
dented, the label might be peeling from the third and so on. In the everyday world, objects 
that seem at first sight to be identical may very well turn out to be distinguishable. Even 
if the cans were exactly identical, we could label them A, B, C and D with an indelible 
marker and so make them distinguishable. 

Things are very different in the world of atoms and molecules. A visit to MolMart® 
for four hydrogen atoms would reveal that all hydrogen atoms are exactly the same; there 
is no question of one atom having a scratch or a dent, and it is certainly not possible to 
label a hydrogen atom with an indelible marker. We say that the four hydrogen atoms are 
indistinguishable. This simple observation has far-reaching consequences. 

I mentioned in earlier sections that wavefunctions themselves do not have any phys¬ 
ical interpretation, but that the modulus squared of a wavefunction is an experimentally 
observable probability density. So if i/r(x) describes a single particle, then 


W (x)\ 2 dx 


gives the chance of finding the particle between x and x + dx and this can be measured in 
experiments such as X-ray and neutron diffraction. 

The wavefunctions and above involve the coordinates of two particles and in 
this case the physical interpretation is that, for example, 


1^ (x A ,x B )\ 2 dx A dx B 


represents the probability of simultaneously finding particle A between x A and x A + dx A 
with particle B between x B and x B + dv B • If we now add the condition that the particles are 
truly indistinguishable then we should get exactly the same probability on renaming the 
particles B and A. So if we consider (</fi i2 ) 2 



it is clear that the probability density treats the two particles on a different footing and so 
implies that they are distinguishable. The acid test is to write down the probability density 
and then interchange the names of the particles. The probability density should stay the 
same. 

The two wavefunctions <ffi 2 and do not satisfy this requirement and so they are not 
consistent with our ideas about indistinguishability. I can illustrate the problem by plotting 
the squares of the wavefunctions as density maps, as in Figures 12.10 and 12.11. 
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The plots are of the square of the wavefunction (the ‘out-of-plane’ axis) versus the x 
coordinates of the two electrons along the in-plane axes. The diagrams imply quite different 
probability densities for an arbitrary choice of the pair of coordinates; the probability 
contours ought to be symmetrical about their diagonals. The problem can be removed if we 
make use of the degeneracy mentioned above and construct a wavefunction that is either 
the sum or the difference of the product functions with x A and x B interchanged. Each of 
these two combinations gives a probability density that is unaffected by an interchange of 
the names of the two particles. Allowing for the correct normalization, these combinations 
are the following. 

Symmetric: 


Antisymmetric: 


*1 (xa,x b ) 


J^(«%(x a ,Xb) + ^2,, 
(iP u (x A ,x B ) - 


(Xa,*b)) 


C*a,*b)) 


(12.19) 


(X A ,X B ) 


( 12 . 20 ) 
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The labels symmetric and antisymmetric arise from the fact that interchange of the particle 
names leaves the first combination unchanged but leads to a reversal of sign in the second 
combination. 

The resulting probability maps are shown in Figures 12.12 and 12.13. 

In the symmetric state the maximum probability occurs at the points for which x A = 
x B = L/4 or 3L/4 and quite generally the probability is large only if the difference between 
the particles is small. 

The antisymmetric state is quite different; the probability is zero everywhere along the 
diagonal (which corresponds to x A = x B ) and the probability maxima occur at positions 
where the two particles are far apart. I must emphasize that this apparent attraction and 
repulsion between particles has nothing whatever to do with the forces between them; 
indeed, we assumed at the start that the particles did not interact with each other. It certainly 
is not an additional force in nature; it is simply to do with the indistinguishability of particles 
and the symmetry of the wavefunction describing them. Like the zero-point vibrational 
energy, this is a purely quantum mechanical effect and has no ‘explanation’ in classical 
physics. 
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12.7 Spin 

It turns out as an experimental fact that many particles have an intrinsic property called spin 
just as they have a mass, a charge and so on. If the particles concerned are electrons, then 
we refer to the phenomenon as electron spin. Electron spin is expressed in two possible 
configurations usually referred to as spin up and spin down with respect to a given axis 
of quantization. I will have more to say about electron spin in a later chapter; it does not 
appear from the Schrodinger treatment, just accept it as an experimental fact for the minute. 

The two states are written conventionally a and |3 and authors often use the symbol 
s for the spin variable. So in the case of electrons, we might write a(.s A ) to mean that 
particle A was in the spin up state and P(s A ) to mean that particle A was in the spin down 
state. In the case of two electrons we have four possibilities and we label the quantum 
states individually a(s A ), P(.v A ), ct(.s B ) and (3(.v B )■ The combination a(s A )(3(s B ) describes 
a situation where electron A has spin up and electron B has spin down and there are four 
possible combination states 


t(i A )a(i B ); a(s A )P(,s B ); P(.y A )a(.v B ); f3(.y A )P(.v B ) 
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The first and the last are automatically symmetric with respect to interchange of the two 
electron names, but the middle two are not satisfactory because they imply that the two 
electrons can somehow be treated differently. Bearing the discussion of Section 12.6 in 
mind, we construct two other spin functions that have definite spin symmetry properties. 
Symmetric: 


Antisymmetric: 


£ 


a Oa) a Ob) 

(a Oa) P Ob) + P Oa) a Ob)) 
P Oa) P Ob) 


£ 


(a Oa) P Ob) — P Oa) a Ob)) 


If we had been considering two spinless particles, for example two alpha particles, there 
would be no spin states and the above complications would not have arisen. We now have 
to combine together the space and the spin wavefunctions. We do this by multiplying 
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together a space part and a spin part, and it should be clear that there are two possible 
results (Table 12.3) and to decide between them we have to resort to experiment. For 
many-electron systems, the only states that are ever observed are those that correspond to 
antisymmetric total wavefunctions. Heisenberg (1926) recognized this fact and this led to 
a detailed understanding of the electronic spectrum and structure of the helium atom. The 
condition of antisymmetry eliminates half of the possible states that would otherwise be 
possible. 


Table 12.3 Combining space and spin functions 


Space part 

Spin part 

Overall wavefunction 

Symmetric 

Symmetric 

Symmetric 

Antisymmetric 

Symmetric 

Antisymmetric 

Symmetric 

Antisymmetric 

Antisymmetric 

Antisymmetric 

Antisymmetric 

Symmetric 


(When considering more than two particles, we have to consider all possible permutations 
amongst the space and spin functions.) 


12.8 Fermions and Bosons 

Not all wavefunctions for identical particles are antisymmetric, but detailed studies show 
that, for a given kind of particle, states of only one overall symmetry exist; the states are 
either all symmetric or all antisymmetric with respect to interchange of particle names. 
Particles whose overall wavefunction is antisymmetric are cal led /ermz'wi.y(after E. Fermi), 
and particles whose overall wavefunction is symmetric are called bosons (after S.N. Bose). 
Typical fermions and bosons are shown in Table 12.4. 


Table 12.4 Fermions and bosons 


Fermion 

Boson 

Electron 

Photon 

Proton 

Deuteron 

Neutron 

Alpha particle 

3 He atom 

4 He atom 


12.9 Pauli Exclusion Principle 

The famous Pauli exclusion principle is a statement of our discussion above; in its general 
form it says that fermion wavefunctions must be antisymmetric to exchange of the names 
of two fermions, whilst boson wavefunctions must be symmetric. The principle is stated in 
different ways in different books. For example, the world’s most popular physical chemistry 
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text (Atkins 2006) says: ‘No more than two electrons may occupy any orbital, and if two 
do occupy it their spin directions must be opposite.’ 

In principle we must invoke the symmetry or antisymmetry of a wavefunction when deal¬ 
ing with systems of two or more identical particles. For example, if we wish to describe two 
ground-state hydrogen atoms, the total wavefunction must be antisymmetric to exchange 
of the names of the two electrons, even if an infinite distance separates the two atoms. 
Common sense suggests that this is not necessary, and the criterion is to ask by how much 
the two wavefunctions overlap each other. If the overlap is appreciable, then we must worry 
about the exclusion principle. If the overlap is negligible, we can disregard the symmetry 
or otherwise of the total wavefunction. 


12.10 Boltzmann’s Counting Rule 

We must now return to the three probabilities discussed in Section 12.5. Consider a simple 
case of three particles A, B and C, each of which has a different energy. 

Figure 12.14 shows three of the six possible arrangements of the three distinguishable 
particles, and, according to the Boltzmann counting procedure, we take account of all six 
and they are all weighted equally when we calculate the number of ways W. 
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Figure 12.14 Boltzmann's counting method 


This procedure is not consistent with the principles of quantum mechanics discussed 
above. We certainly cannot label particles and make them distinguishable, and it is not 
meaningful to enquire which particle is in one particular state. All we can do is to say how 
many particles are in each state, and instead of six possibilities there is just one (with one 
particle in each state). 

We then have to consider whether the particles are fermions or bosons; if they are fermions 
with spin 1 / 2 (like electrons), then each quantum state can hold no more than two particles, 
one of a and one of (3 spin. If they are bosons, then the exclusion principle does not apply 
and all the particles can crowd together in any one state, should they so wish. This is the 
basis of the three different probabilities discussed in Section 12.5. 

That leaves just one loose end: if all particles are either fermions or bosons, why does 
the Boltzmann distribution work so well in explaining the properties of ‘ordinary’ gases? 
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The overwhelming experimental evidence is that the particles in an ordinary gas can be 
treated as if they were distinguishable particles, subject to the Boltzmann probability law. 
The answer is quite simply that under many normal circumstances the quantum states are 
very sparsely occupied and so fermions do not need to be aware of the exclusion principle, 
whilst bosons cannot display their ability to all crowd together into the lowest state. 
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13 

One-Electron Atoms 


The hydrogen atom occupies an important place in the history of molecular modelling and 
quantum theory. It is the simplest possible atom, with superficially simple properties, yet 
it was a thorn in the side of late nineteenth and early twentieth century science. Rutherford 
had discovered the nucleus by 1911, and had a rough idea of its size from his scattering 
studies and so people pictured the hydrogen atom as a negative charge (the electron) making 
a circular orbit with constant angular velocity round the nucleus (a positive charge). This 
is shown in Figure 13.1. 



Figure 13.1 A one-electron atom 


I should mention a small technical point: if we want to eliminate the overall translational 
motion of the atom from our discussion, the atomic system, nucleus plus electron, actually 
rotates round the centre of gravity. This rotational motion of the electron (mass m e ) and the 
nucleus (mass M ) is equivalent to rotation of a single particle with reduced mass /i about 
the origin. Here 

1 _ i 1 
/z ~~ M + rn e 
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On account of its circular motion, the electron is accelerating. According to Maxwell’s 
classical electromagnetic theory, accelerating electric charges emit energy as electromag¬ 
netic radiation and so the atom ought not to be stable, since it should lose all its energy as it 
orbits the nucleus and emits radiation. Also, the frequency of this emitted radiation should 
equal the frequency of revolution. 


13.1 Atomic Spectra 


The problems were that atoms are certainly stable, and that a hydrogen atom can emit or 
absorb electromagnetic radiation having only certain specific wavelengths. For example, 
Balmer had discovered in 1885 that a series of five spectral lines in the visible region with 
wavelengths A = 656,486,434,410 and 397 nm fitted the formula 


(13.1) 


It was eventually realized that the reciprocal of the wavelength was the more fundamental 
quantity and that the equation was better written 


1 

Xn 


= r h 


(H) 


(13.2) 


(where R H is the Rydberg constant for hydrogen). Similar series of spectral lines were 
observed for He + where the proportionality constant is (very nearly) \R\\, for Li 2+ (with a 
proportionality constant of almost ~i? H ) and so on. The important point is that the value of 
1/A. for every spectral line is a difference between two terms that we now know correspond 
directly to particular energy levels in the atom. 


13.1.1 Bohr’s Theory 

Neils Bohr is reputed to have said that once he saw the Balmer formula, the whole problem 
of the hydrogen atom became clear to him. In any event, in 1913 he was able to devise 
a spectacularly successful theory that could account for the spectral lines. He equated the 
centrifugal force for circular motion with the attractive electrostatic force between the 
nucleus (charge Ze) and the electron (charge —e): 


Ze 2 /iv 2 

47T£ 0 r 2 r 


(13.3) 


This gives two unknowns, r and the speed v. He assumed that the orbital angular momentum 
was quantized in units of h/ltt: 




h 

n lit 


(13.4) 


where the quantum number n could take values 1, 2, 3, . . . This assumption is exactly 
equivalent to requiring that the circumference of the orbit is just the right value to per¬ 
mit the electron to behave as a standing wave (i.e. a de Broglie wave). In other words, 
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the circumference has to be an integral number n of de Broglie wavelengths, that is 
to say 

2 Ttr = n- (13.5) 

P 

where p is the momentum (= /xv). Simultaneous solution of Equations (13.3) and (13.4) 
gives allowed values for the radius and the speed 


_ e 0 h 2 n 2 
7r/zZe 2 


Ze 2 

2e 0 hn 


(13.6) 


and also quantized energies given by 


jtiZV 1 
Wei 


(13.7) 


Bohr referred to these orbits as stationary states because according to his theory the electron 
did not radiate when it was in any one such state. When the electron passed from one state 
(the initial state) to another (the final state), radiation was emitted or absorbed according 
to the Planck-Einstein formula 

hen 

Ifif - £il = (13.8) 

Bohr’s theory predicted the existence of many series of spectral lines for the hydrogen atom 
(for which Z = 1) whose wavelengths fitted the generic formula 



n { = 1,2,3,... 

«i > «f 


The Balmer series corresponds to n f = 2, and the series corresponding to n t = 3 had already 
been observed in the infrared region of the electromagnetic spectrum by Paschen. Soon 
after Bohr published his theory, Lyman identified the series with n f = 1 in the ultraviolet. 
The n t — 4 (Brackett) and n f = 5 (Pfund) series were also identified. Bohr’s theory also 
predicted a first ionization energy (given by R K hc 0 , approximately 2.16806 x 10 -18 J or 
13.5 eV) that agreed well with experiment, and a physically reasonable value (52.9 pm) for 
the radius of the lowest energy orbit. 

Many series with high values of n have since been observed. For example, radio astro¬ 
nomers are very interested in the n, = 167 to n t = 166 emission which occurs at a wavelength 
of 21.04 cm. 

Bohr’s theory gives the following expression for R H , in agreement with experiment: 
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13.2 Correspondence Principle 


Bohr made much use of the correspondence principle discussed in Chapter 11, which says 
that quantum mechanical results must tend to those obtained from classical physics, in the 
limit of large quantum numbers. For example, if we consider the transition from level n to 
level n — 1, the emitted frequency is 

_ /re 4 Z 2 / 1 1 \ 

%elh? \ (n — l) 2 n 2 ) 


For large n this approximates to 


/re 4 Z 2 2 
8e 2 h 2 


(13.9) 


According to Maxwell’s electromagnetic theory, an electron moving in a circular orbit 
should emit radiation with a frequency equal to its frequency of revolution v/2itr. Using 
the Bohr expressions for v and r we deduce an expression in exact agreement with the 
frequency given by Equation (13.9). 

Despite its early successes, Bohr’s theory had many failings. For example, it could not 
explain the structure and properties of a helium atom. Many ingenious attempts were made 
to improve the model, for example by permitting elliptic orbits rather than circular ones, but 
the theory has gradually faded into history. It is sometimes referred to as the old quantum 
theory. 


13.3 Infinite Nucleus Approximation 


I started the chapter by correctly considering the motion of a one-electron atom about the 
centre of mass, and pointed out that this was equivalent to the motion of a single particle 
of reduced mass 

1 _ 1 1 

p ~~ M + m. 

Because the mass of the nucleus is so much greater than that of the electron, the reduced 
mass of the atom is almost equal to (and slightly less than) the mass of the electron. For 
that reason, workers in the field often treat the nucleus as the centre of coordinates and 
the electron as rotating round the nucleus, which is taken to have infinite mass; this is 
called the infinite nucleus approximation. It is just a small correction, the two masses p 
and m e are equal to 1 part in 10 4 which is usually good enough, but as we have seen, atomic 
spectroscopic data are known to incredible accuracy and it is sometimes necessary to take 
account of this difference. We write, for an infinite mass nucleus, 




where a 0 is called the first Bohr radius and R„c 
Rcc = 10 973 731.5685458 m- 1 . 


e 0 h 2 


(13.10) 


0 7tm e e 2 

is ‘the’ Rydberg constant, with value 
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13.4 Hartree’s Atomic Units 

One problem we encounter in dealing with atomic and molecular properties is that large 
powers of 10 invariably appear in equations, together with complicated combinations of 
physical constants such as those in Equations (13.10). For simplicity, we tend to work 
in a system of units called the Hartree units, or atomic units (we will soon come across 
the two Hartrees in Chapter 14). The ‘atomic unit of length’ is the bohr and is equal to 
a 0 above. The ‘atomic unit of energy’ is the hartree and is equal to 2R x hc 0 . It is written 
E h and it is also equal to the mutual potential energy of a pair of electrons separated by 
distance a 0 : 

E " = r-- 

47re 0 «o 

Other atomic units can be defined, as shown in Table 13.1. 


Table 13.1 Hartree's atomic units 


Length 

Mass 

Energy 

Charge 

Electric dipole moment 
Electric quadrupole moment 
Electric field 
Electric field gradient 
Magnetic induction 
Electric dipole polarizability 
Magnetizability 


Q 

p, ea 0 

9 e eafg 

E E h e~'ag' 

~V m E h e-'ag 2 

B (h/2n)e~% 

a e 2 a*E^ 

? e 2 a 2 m;' 


5.2918 x 10-" m 
9.1094 x KT 31 kg 
4.3598 x 10“ 18 J 
1.6022 x 10' 19 C 
8.4784 x 10' 30 C m 
4.4866 x 10“ 40 C m 2 
5.1422 x 10 11 V m -1 
9.7174 x 10 21 V m -2 
2.3 5 05 x 10 5 T 
1.6488 x 10“ 41 C 2 m 2 J-’ 
7.8910 x 10 -29 JT -2 



The Bom interpretation of quantum mechanics tells us that <P t (r)>P(r)dT represents the 
probabihty that we will find the particle whose spatial coordinate is r within the volume 
element dr. Probabilities are real numbers and so the dimensions of the wavefunction must 
be (length)- 3 / 2 . In the atomic system of units, the unit of wavefunction is therefore Og 3/2 
(or (bohr)" 3 / 2 ). 


13.5 Schrodinger Treatment of the Hydrogen Atom 

The wavefunction of a hydrogen atom </( ot depends on the coordinates of the electron x, y, 
z and the nucleus X, Y, Z. The time-independent Schrodinger equation is 

h 2 /a 2 */'™ 9 2 ^tot 9 2 ^tot\ h 2 (d 2 <P 9 2 ^ d 2 V\ 

~ 8 nbn, \ dx 2 + 9y 2 + 9z 2 j _ Sn 2 M ( 9X 2 + 9T 2 + 9Z 2 ") + ^ _ Stot ' I ' tot 


(13.11) 
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where m e is the electron mass, M the nuclear mass and U the mutual electrostatic 
potential energy of the nucleus and the electron, given by 


U- 


-Ze 2 1 

47re ° y/{x - Xf + (y - Y ) 2 + (z - Zf 


(13.12) 


I have temporarily added a subscript ‘tot’ to show that we are dealing with the total atom, 
nucleus plus electron, at this stage. The equation is not at all simple in this coordinate system 
and it proves profitable to make a change of variable. We take the coordinate origin as the 
centre of gravity of the atom (coordinates x c , y c , z. c ) and use spherical polar coordinates for 
the nucleus and the electron. If the spherical polar coordinates of the electron are a, 6, (f> 
and those of the nucleus b, 6, <p where 


M 



M + m t 


then, if yu. is the reduced mass 
m e M 

^ ~ m e + M 

X = x c — — r sin# cos 0 x = x c + — r sin 6 cos0 
M m e 

(i ii 

Y =y c -r sin# sin0 y = y c H- r sin# sin0 

M m e 

Z = Zq - r cos# z = Z c H- r cos# 

M m t 

With this change of variable we find 

h 2 ( 3 2 ^ tot 3 2 ^ tot 9 2 ^, ot \ 

8 n 2 (m e + M) \ dxl + dy% + dz 2 ) 

M a / 2 a^ tot \ i a / 1 _J 2 ^t 0 , \ 

8>tt 2 h \r 2 dr \ dr ) r 2 sin# 3# \ 3# / r 2 sin 2 # dtp 2 ) 


and also in this coordinate system 


(13.13) 


(13.14) 


= Etot^tot 

(13.15) 


4ne 0 r 

It is apparent that we can separate the wave equation into two equations, one referring to the 
centre of mass and involving the total mass of the atom, and the other containing r, 9 and 
0 We put = 0 e 0c and follow through the separation of variables argument to obtain 





One-Electron Atoms 197 


8n 2 (m e + M) 


V dx 2 dy 2 dz 2 c ) 


= (e,ot - e.) fic (13.16) 


h 2 


mm 


1 9 / 9lAe\ 1 

-:-( sin#—— ) +-, 

r 2 sin 9 d6 \ 89 J r 2 sin* 


0 


9Ve \ 

dr) 


+ ur = 

(13.17) 


The first equation relates to the translation of the atom as a whole, and I have dealt with such 
equations in earlier chapters. The second equation is usually called the electronic equation. 

It should be clear from this discussion that in the treatment of any atomic or molecular 
system the translational degree(s) of freedom may always be separated from the internal 
degrees of freedom and so need not be considered in general. Also, from now on, I will 
drop the subscript ‘e’ from the electronic equation. 

In the special case of a one-electron atom, the electronic wavefunction depends only 
on the coordinates of (the) one electron and so it is technically described as an atomic 
orbital. In view of the spherical symmetry, we might expect that it would prove profitable 
to consider a further separation of variables 


1r{r,d,<t>) = R(r)Y(e,4>) 


(13.18) 


This proves to be the case and R(r) is usually called the radial function (although I should 
warn you that many authors call P(r) = rR(r) the radial function; we will see why this 
proves a useful choice shortly). The 7’s turn out to be eigenfunctions of the orbital angular 
momentum, and again we will see why in due course. 

It turns out that most of the wavefunction solutions of Equation (13.18) are not spherically 
symmetrical, and you may find this odd given that the mutual potential energy U depends 
only on the scalar distance r (we call such fields central fields). The fact is that U(r) 
being spherically symmetrical does not imply that the solutions are also spherical; the 
gravitational field is also a central field, and you probably know that planetary motion is 
not spherically symmetrical. 


13.6 Radial Solutions 

Before launching into a discussion of the full solutions, I want to spend a little time consid¬ 
ering just those solutions that depend only on r. That is, those solutions that have a constant 
angular part; this is not the same thing as finding R(r) for the general case. I will also make 
the infinite nucleus approximation from this point on. 

A little rearrangement and manipulation gives, for functions that depend only on r, 


fi 2 d 2 (rR) Ze 2 
87r 2 m e dr 2 



(13.19) 


The form of this equation suggests that we make the substitution P = rR which explains 
why some authors focus on P{r) rather than R(r) and refer to P(r) as the radial function. 
This substitution gives 
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h 2 d 2 P 
~&jr 2 m c dr 2 ~ 


Ze 2 


-P = eP 


Ane 0 r 

It is usual in many scientific and engineering applications to try and simplify such com¬ 
plicated equations by use of ‘reduced variables’. In this case we divide each variable by its 
atomic unit, so for example 

7-red = r/d 0 

fired = S/E h 


d 2 


°dr 2 


and simplify to give 


-/ 2 red = fired/’red 


- -P = eP 


(13.20) 


1 d 2 P xti 

2 dr 2 d 

It gets a bit tedious writing the subscript ‘red’, so most authors choose to state their equations 
in dimensionless form such as 

1 d 2 P 
~2dr 2 " 

with the tacit understanding that the variables are reduced ones and therefore dimensionless. 
Sometimes authors say (incorrectly) that the equation is written ‘in atomic units’, and 
occasionally we come across statements of the impossible (e.g. m t = 1 and h/2n = 1). 

We therefore have to solve the differential Equation (13.20) and we follow standard 
mathematical practice by first examining the limiting behaviour for large r : 

d 2 P 

7T = ~ 2sP 
d r- 

If we limit the discussion to bound states (for which e is negative), we note that 

P(r)«exp(-0r) (13.21) 

where ft is a positive real constant. This suggests that we look for a wavefunction of the 
form 

P(r) = 0 ( r ) exp {-fir) (13.22) 


where <P(r) is a polynomial i 
Equation (13.20) then gives 

d 2 <P 


. Substitution of Equation (13.22) into the radial 


dr 2 


~ 2 P-a 


h (V + 2e + ^ 4 


Suppose for the sake of argument we choose 

<J> (r) = a 2 r 2 + dir + a 0 (13.24) 

where the coefficients a have to be determined. Substitution of Equation (13.24) into the 
differential Equation (13.23) gives 


2a 2 - 2p (2 a 2 r + a l ) + (p 2 + 2e + 




(a 2 r 2 + ai r + a 0 ) = 0 
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For this equation to hold for all values of the variable r, the coefficients of all powers of r 
must separately vanish; thus 

(p 2 + 2s) <*2 = 0 
(P 2 + 2s) <*i + 2 (Z — 2P) a 2 = 0 
(P 2 + 2s) <* 0 + 2 (Z - P)a, + 2<* 2 = 0 
2Z<* 0 = 0 


It follows that 



Z<*! + 2<*2 = 0 

<*0 = o 

The first two equations give an energy of —Z 2 /8 and because e is actually the energy divided 
by an atomic unit of energy E h we have 



The third and fourth equations determine all the coefficients but one, which we are free to 
choose. If we take <*i = 1 then <* 2 = —Z/2 and we have 

<*>(r)«f-|r 2 + r (13.25) 


13.7 Atomic Orbitals 

We must now examine the general case where the wavefunction depends on three variables. 
Substitution of 

rlr(r,6,4>)=R(r)Y(6,<l>) (13.26) 

into the electronic Schrodinger equation (Equation (13.17)) gives 

1 9 ( . dR\ 8n 2 m e , 19/ dY\ 1 d 2 Y 

Rdr\ dr) h 2 Tsin0 90\ 90/ Tsin 2 0 9</ 2 

(13.27) 

By our usual argument, both sides of this equation must be equal to a constant that we will 
call X. We then have two equations 
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1 d / 2 d R\ /8 7T 2 m e X\ 

7-ar{ r ^) + {-^ i£ - U) -^) R = 0 

sin0 90 \ 90/ sin 2 0 90 2 


I have discussed the second equation in Appendix A when considering angular momentum; 
the allowed solutions are the spherical harmonics Y tmt (0, <p), where X = 1(1+ 1) and l and 
m, are integers. Introducing the value for X into the first equation of (13.28) and expanding 
the first term gives 


d 2 R 2 d R 
dr 2 + r d7 


/ 8jr 2 m e 


(e-U)~ 


(13.29) 


or, in terms of P(r) introduced above (where P(r) = rR(r )) 


d 2 P 

dr 2 


(13.30) 


The term in 1(1+ 1) is called the centrifugal potential; it adds to the Coulomb term to give 
an effective potential. 

The radial equation (13.30) is more complicated than Equation (13.20) because of the 
1(1+ 1 )/r 2 term but in essence the same method of solution can be used. The details are 
given in standard traditional quantum chemistry texts such as Eyring, Walter and Kimball 
(EWK) (Eyring et al. 1944). The radial solutions are a set of functions from mathematical 
physics called the associated Laguerre polynomials (the Laguerre polynomial L a (x) of 
degree a in x is defined as 

d“ 

L a (x) = exp (x) — (x a exp (-*)) 

and the /0th derivative of L a (x) is called an associated Laguerre polynomial). 


13.7.1 l = 0 (s-orbitals) 

The atomic orbitals are named depending on the three quantum numbers. The first three 
s-orbitals are given in Table 13.2; they are conventionally written in terms of the variable 
p = Zr/a 0 (where Z = 1 for hydrogen). 


Table 13.2 The first few s-orbitals 


n, ?, m 

Symbol 

Normalized wavefunction 



1 /z \ 3t2 

1 , 0 , 0 

Is 

vsrUJ e * p( -'” 

2, 0, 0 

2s 


3, 0, 0 

3s 
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They are often presented as plots of \fr versus p or of An r 2 if/ 2 versus p. The latter is 
known as the radial distribution function and it gives the probability of finding the electron 
between two shells of radii r and r + dr surrounding the nucleus. Representative plots for 
hydrogen are shown in Figures 13.2 and 13.3. The dimension of ijr is (length) _3/2 and so 
the quantities plotted are respectively and a 0 {Anr 2 rj/ 2 ) against r/a 0 . 




Distance/aO 


Figure 13.3 Radial distribution curves for the hydrogen Is, 2s and 3s orbitals 


All the curves are asymptotic to the horizontal axis at infinity. The 2s curve crosses the 
axis once (it has a single radial node), the 3s curve crosses the axis twice and so on. There is 










202 Molecular Modelling for Beginners 


nothing else particularly remarkable about such plots of decreasing exponential functions. 
The radial distribution curves are a little more interesting. 

The radial distribution curve shows a maximum at the first Bohr radius for the Is orbital. 
Bohr’s theory states that the electron would be in a fixed circular orbit round the centre 
of mass with exactly this radius! As the principal quantum number increases, so does the 
average value of the radial distribution function. Elementary chemistry texts attach much 
importance to diagrams of these kinds. 

There are other ways to visualize the atomic orbitals. Figures 13.4 and 13.5 show contour 
diagrams for the Is and 2s hydrogen atomic orbitals. The particular software package I used 
marks the axes with somewhat arbitrary units depending on the range of points chosen, 
although the contour values themselves are correct and the nucleus is at the centre of the 
plane. 



Figure 13.4 Hydrogen Is contour diagram (nucleus at centre) 


13.7.2 The p-orbitals 

For angular momentum quantum number 1=1, there are three possible values of m h 
namely —1, 0 and +1. The angular factors for m, = ±1 are complex (they involve the 
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Figure 13.5 Hydrogen 2s contour diagram 


square root of — 1 in an exponential factor) and it is usual to make use of the de Moivre 
theorem in order to visualize the orbitals: 

cos (j> = X - (exp (jcj>) + exp (-/</>)) 

, (13.31) 

sin <P= — (exp (/'</>) - exp (-;<£)) 

2 J 


The real equivalent p-orbitals are therefore taken as linear combinations of m ( = +1 and 
— 1. Despite the name, they are not completely equivalent to the complex orbitals as only 
the latter are eigenfunctions of the z component of the angular momentum. Luckily, this 
only matters in the presence of an external magnetic field. The real equivalent 2p and 3p 
orbitals are shown in Table 13.3. 

These are usually represented as contour diagrams; the orbital shapes are memorable 
because they are equivalent and point along the appropriate axes. Figure 13.6 shows a 
hydrogenic 2p z orbital, which points along the vertical z axis. 
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Table 13.3 The first few p-orbitals 


n, 1, m 

Symbol 

Normalized wavefunction 

2, 1, ±1 

2p* 

ss' 

g) 

p sin (6) cos (0) exp 00 


2Py 


gy 

P sin (9) sin (0) exp 00 

2, 1, 0 

2Pz 

4^/Tn ^ 

g) 

p cos (0) exp (-0 

3, 1, ±1 

3p* 


gy 

(6p - P 2 ) sin (0) cos (0) exp 00 


3Py 

81 v'SF ' 

g) 

(6p - p 2 ) sin (0) sin (0) exp 00 

3, 1, 0 

3pz 

8lgS^ 

gy 

(6p - P 2 ) cos (0) exp 00 



Figure 13.6 Hydrogen atom 2p z orbital 
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13.7.3 The d-orbitals 

Similar comments apply to the / = 2 orbitals. There are five in all, four of which come in 
complex pairs. We combine together the corresponding values of mi and — m, just as for 
the p-orbitals to give real equivalents. Table 13.4 records the 3d orbitals. 


Table 13.4 The 3d orbitals 


n, I, mi Symbol Normalized wavefunction 



Once again these are usually represented as contour diagrams (Figures 13.7 and 13.8). 


13.8 The Stern-Gerlach Experiment 

If we suspend a compass needle in the Earth’s magnetic field, it aligns itself along the 
magnetic field lines. A compass needle is an example of a magnetic dipole, and the strength 
of the interaction between the compass needle and this external magnetic field is determined 
by the magnetic dipole moment p,„. This is a vector quantity pointing by convention from 
the south pole of the needle to the north pole. The interaction between the dipole and the 
field is determined by the magnetic potential energy 


U m = — Pm-B (r) 


(13.32) 


where the position vector r is the position of the dipole and B the magnetic induction. If B 
is uniform and so does not depend on r, the gradient is zero and so the force is also zero, 
in accord with experiment. 

Around 1820, Oersted discovered experimentally that electric currents could exert 
forces similar to those exerted by permanent magnets, for example on a compass needle. 
Figure 13.9 shows a simple current loop located in the xz plane and carrying steady current 
I. If the loop is flat, the dipole is perpendicular to the plane and if we consider points on 
the axis far away from the loop it turns out that the magnitude of p m is 


p m = IA 

where A is the area of the loop (which need not be circular). 
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Figure 13.7 Hydrogen 3d zz orbital 


It is easy to demonstrate from classical physics that a particle with mass M, charge Q 
and angular momentum 1 is a magnetic dipole: 



According to Bohr’s model of the hydrogen atom, the allowed electron orbits each had 
an angular momentum that was an integral multiple of h/2it. Since magnetic dipoles are 
linked to angular momentum, the possibility arises that if we could measure the mag¬ 
netic dipole moments of individual atoms we could investigate the quantization of angular 
momentum. In the presence of an external magnetic field, the bar magnet will take up one 
of its possible alignments with the axis of quantization (the direction of the magnetic field 
lines). 

Stem and Gerlach (1921) performed the first and most famous experiment designed to 
investigate the quantization of angular momentum. Whilst the force on a magnetic dipole is 
zero in a uniform magnetic field, the force is not zero for a nonuniform field. It is difficult to 
hang a single atom between the poles of a magnet, so a beam of atoms was passed through 
such a field and the deflection of the beam measured. 
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to battery 


Figure 13.9 Current loop 
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A schematic diagram of the Stern-Gerlach equipment is shown in Figure 13.10. Abeam 
of atoms is produced in an oven, and they pass through a collimating slit and often a 
velocity selector in order to form a mono-energetic beam. They then travel in the x direction 
through a long, powerful electromagnet whose pole pieces are deliberately shaped to give a 
nonuniform magnetic field in the z direction. Detailed analysis shows that the force on the 
magnetic dipoles in the z direction is given by a force proportional to the z component of the 
magnetic dipole moment (and hence the z component of the angular momentum vector): 

dB z 

F z =Pm^ (13.34) 

3 z 

It is a difficult experiment to perform satisfactorily, for the atoms collide with each other 
in the beam and so distort the result. An essential part of such an atomic beam apparatus 
is that there should be a good vacuum system, and very low beam intensities have to be 
used. Despite the experimental difficulties, the findings were positive; the atom beam was 
shown to split on passage through the magnetic field and this confirmed the quantization 
of angular momentum. 



Figure 13.10 Stern-Gerlach experiment 


13.9 Electron Spin 

The first experiments were made with silver atoms. The surprising result of the Stern- 
Gerlach experiment was not that the beam split, but that it split into two components, with 
a different separation than that expected. Similar results were obtained for copper and gold, 
and in later work for the alkali metals and for hydrogen atoms. The point is that an atomic 
dipole moment characterized by quantum number l should show 21 + 1 orientations with 
the magnetic field. Since the quantum number takes values 0,1, 2,... there should always 
be an odd number of orientations and hence the atom beam should split into an odd number 
of components. 

The explanation came in 1925, when Goudsmit and Uhlenbeck (1925) analysed the 
splittings of spectral lines occurring when atoms are subject to an external magnetic field (the 
Zeeman effect). They showed that the measured splittings could be explained if electrons 
were postulated to have an intrinsic magnetic moment, in addition to the one they acquired 
by motion about the atomic nucleus. In order to explain the experimental results, they 
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assumed the existence of a spin angular momentum vector s, which had similar properties 
to 1 above. In order to account for the splitting into two beams, they postulated a spin 
quantum number for electrons s of y 2 and so a spin magnetic quantum number m s of 
±y 2 . The spin wavefunctions are usually written a (corresponding to the +*/ 2 spin quantum 
number) and P (corresponding to the - l / 2 quantum number). So we write, just as for ordinary 
angular momentum 


s 2 p 

s z a 

u 


1 (\ \ h 2 

2 (2 + 7 4jt 2 ' 


~ U + l 7-7. 


4 n 2 

h 2 R 

4 n 2 ^ 


2 2n 

-~-P 

2 2n 


(13.35) 


A problem appeared once the splittings were analysed. According to classical theory, the 
spin magnetic moment of an electron should be 


Pspm 


2 OT e 


(13.36) 


whilst to get agreement with the splittings, an extra factor of (almost) 2 was needed. The 
solution to the problem was the introduction of the g-factor, an experimentally determined 
quantity that made Equation (13.36) correct. Thus we write 



The electronic g e factor is another of physical science’s very accurately known constants, 
having a value of g e = 2.0023193043787 ± (82 x 10 -12 ) 


13.10 Total Angular Momentum 

When orbital angular momentum and spin angular momentum both exist in the same atom, 
the magnetic moments that result from these two angular momenta interact to cause a 
splitting of the energy level. The interaction that leads to this splitting is called spin- 
orbit coupling and it couples the two into a resultant total angular momentum. A simple 
vector model that is very similar to the model used to describe orbital angular momentum 
can describe this. According to this model, the total angular momentum of an electron is 
characterized by a quantum number j. For any given nonzero value of l the possible values 
of j are given by 

j = l±s 

(The use of j as an atomic quantum number is not to be confused with the use of j for the 
square root of —1). The rule is that the quantum number j must always be positive, so if 
l = 1 then j = 3/2 and 1/2, but if / = 0 then we only have j = 1/2). 
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I have emphasized earlier the interplay between spectroscopic data and theoretical 
developments; even before Schrodinger’s time many highly precise spectroscopic data 
had been obtained experimentally. The traditional reference sources for such data are the 
three volumes of tables in the Atomic Energy Levels series (Moore 1949, 1952, 1958). 
In the past few years, the National Institute of Standards and Technology (NIST) atomic 
spectroscopists have made available a unified comprehensive Atomic Spectra Database 
on the World Wide Web, which contains spectral reference data for 91 000 wavelengths 
(http://www.nist.gov). The sample in Table 13.5 is taken from the database. Note that the 
data contains spectroscopic term symbols for each level, which are discussed in all ele¬ 
mentary physical chemistry undergraduate texts. Spectroscopists traditionally deal with 
term values rather than energies; these are just e/hc 0 . 


Table 13.5 Internet search for atomic term values 


Configuration 

Term symbol 

j 

Term value/cm 1 

Is 1 

2 S 

1/2 

0 

2p' 

2 P 

1/2 

82 258.9206 


3/2 

82 259.2865 

2s 1 

2 S 

1/2 

82 258.9559 

3p' 

2 P 

1/2 

97 492.2130 


3/2 

97 492.3214 

3s 1 

2 S 

1/2 

97 492.2235 

3d 1 

2 D 

3/2 

97 492.3212 



5/2 

97 492.3574 

4 P ' 

2 P 

1/2 

102 823.8505 


3/2 

102 823.8962 

4s 1 

2 S 

1/2 

102 823.8549 

4d’ 

2 D 

3/2 

102 823.8961 



5/2 

102 823.9114 

4fi 

2 F 

5/2 

102 823.9113 



7/2 

102 823.9190 


13.11 Dirac Theory of the Electron 


There is no mention of electron spin from the Schrodinger equation, and certainly no clue 
as to why the classical equation for magnetic moments is in error by (roughly) a factor 
of 2 when applied to electron spin (Equation (13.36)). If we consider the time-dependent 
Schrodinger equation for a free electron 


h 2 / 9 2 d 2 

87r 2 m e \9x 2 dy 2 


(13.37) 


(where j is the square root of — 1), it is seen to be a second-order partial differential equation 
with respect to the spatial coordinates and a first-order partial differential equation with 
respect to time. It therefore is not consistent with the special theory of relativity, which 
requires that time and space should enter such equations on an equal footing. If the equation 
is second order in space, it should also be second order in time. 
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Erwin Schrodinger decided that the way ahead was to abandon the classical energy 
expression and start again with the relativistically correct equation 

e 2 = m 2 Cg + c 2 p 2 (13.38) 

He then made the operator substitutions 


j 2tt dt 

to arrive at the Klein-Gordan equation 


.h_d_ 

3x 


\3x 2 3 y 2 dz 2 cl dt 2 h 2 ) 

In discussing relativistic matters, it is usual to write equations such as this 'm four-vector 
notation. We define a four-vector with components 


(13.40) 


o that Equation (13.39) becomes 


\3X| 


3xf dx? dxl 


4 Tz 2 m 2 t i 


V (x u x 2 ,x 3 ,x 4 ) = 0 


The Klein-Gordan equation is more satisfactory in that it has the desirable symmetry but 
it turns out that it cannot describe electron spin. In the limit of low energy, it is equivalent 
to the familiar Schrodinger equation. 

Paul Dirac had the ingenious idea of working with a relativistic equation that was linear 
in the vector components. He wrote 

' 3 3 ” <*„*.».*) = 0 (13.41) 




dXi 


Ey 2 


dx 2 


E y 3 


dx 3 


E y 4 


where the multipliers y, have to be determined. This equation is called the Dirac equation. 
Both the Schrodinger and the Klein-Gordan equation are second order, and it is usual to 
manipulate the Dirac equation in order to give a corresponding second-order equation. This 
can be done by operating on Equation (13.41) with the operator 


Yi 


3x, 


l- y 2 


dx 2 


Ey 3 


dx 3 


b y 4 


2 jrm e c 0 


3x 4 


A little operator algebra shows that the multipliers have to satisfy 
f 2 if 7=y 

r,rj + Y,r,= j 0 xttj 
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It can be shown that any particle whose wavefunction satisfies the Dirac equation must 
be a spin - l / 2 particle. Not only that, the Dirac treatment gives the correct value for the 
magnetic moment in that it gives 



13.12 Measurement in the Quantum World 

The process of measurement in quantum mechanics is subtler than in classical mechanics. 
As discussed in Appendix A, the possible results of measurements depend on the eigenvalues 
of the appropriate operators. Also, if we wish to make simultaneous measurements of two 
observables (such as the linear momentum and position, or two components of an angular 
momentum vector), we have to take account of Heisenberg’s uncertainty principle. Certain 
pairs of observables can be measured simultaneously to arbitrary precision whilst other 
pairs cannot. 

There is worse to come, for the word measurement gets used in two different ways in 
quantum mechanics. Suppose we have a one-electron atom as discussed above; we know 
that the energies are found from Schrddinger’s equation, which I will write in Hamiltonian 
form as 

Hft = Eifi 

If the atom is in state i/r,- then repeated energy measurements on the same atom will always 
yield the same result, s t (we say that the system is in a pure state). 

If on the other hand we pass a beam of electrons through a Stem-Gerlach apparatus, the 
magnet separates the beam into two components that correspond to the two spin eigenvalues 
m, = +'/ 2 and m s = — y 2 . This kind of measurement is referred to as state preparation for 
if we pass the m s = +y 2 beam through a further Stem-Gerlach apparatus oriented in the 
same way as the first, we simply observe the one beam. I have shown this schematically in 
Figure 13.11. 




SG 


SG 


Apparatus 


Apparatus 





Figure 13.11 Repeated Stern-Cerlach (SC) measurements 


However, electrons in the incident beam are not in pure states, and their spin wavefunction 
can be written as linear combinations of the two spin functions a and fi: 

ir = au +bp 

They are said to be in mixed states. Here, a and b are scalar (complex) constants. The 
question is how do we interpret the measurement process? For example, does a given 
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electron in some sense know which spin state it should be in, before passage through 
the apparatus? Similar questions puzzled Schrodinger and his contemporaries, and led 
Schrodinger to state the famous cat paradox. In a version of this thought experiment, a 
cat is placed in a closed box together with (for example) a radioactive atom that releases a 
deadly poison when it decays. If the wavefunction of the living cat is Vh and that of a dead 
cat iAd then the state of the cat at time t can be described by 

lAcat (?) = C L it) + C D (?) lAn 

The coefficients are time dependent, as is the wavefunction. Before the box is closed, the 
cat is alive. Once the box is closed, the cat is apparently neither dead nor alive; it is in a 
mixed state. On opening the box at a certain time, the cat is either dead or alive, but is it 
the act of measurement that has forced the cat into one state or the other? 

There are whole rafts of similar questions that can be asked. Most of the difficulties can be 
resolved by recognizing that we are asking a statistical question, not one about a specific cat. 
If we prepare a large number of identical experiments, with identical cats and amounts of 
radioactive isotopes, close the lids on the boxes and then examine a statistically significant 
number of boxes, we will find that a fraction |c D (?) | 2 will have died and a fraction |c L (?)] 2 
will be alive, but no prediction can be made as to the state of a given cat in a given box. 
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The Orbital Model 


Our next step is to consider a many-electron atom, such as that illustrated in Figure 14.1. 


o 

Nucleus 





Figure 14.1 Many-electron atom 


I am going to make the ‘infinite nuclear mass’ approximation, and only consider the 
electronic problem. The atom is therefore fixed in space with the nucleus at the centre 
of the coordinate system. The n electrons are at position vectors ri, r 2 ,..., r„ and the 
scalar distance between (say) electrons 1 and 2 is r 12 in an obvious notation. The elec¬ 
tronic wavefunction will depend on the coordinates of all the electrons, and I will write it 
W (r,, r 2 ,..., r„) If the nuclear charge is Ze, we have to consider three contributions to the 
electronic energy: the kinetic energy of each electron, the Coulomb attraction between the 
nucleus and the electrons and finally the Coulomb repulsion between pairs of electrons. We 
therefore write the Hamiltonian operator as 
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a -5(-^(£ + S + £)-^) + §S 


e 2 

47T e 0 rij 


14.1 One- and Two-Electron Operators 


In order to stop the notation becoming unwieldy, I will group these terms as follows. Each 
term in the first bracket refers to the coordinates of the individual electrons, and gives the 
kinetic energy of each electron together with its attraction to the nucleus. I will call such 
terms one-electron operators and authors normally write them as 


h m (r,-) = ^5 + fr 


Ze 2 
4 7te 0 ri 


(14.2) 


Each term in the second double summation gives the Coulomb repulsion of a pair of 
electrons; I will refer to them as two-electron operators and write 


gfa’Tj) 


4n s 0 rij 


The Hamiltonian is then, in our compact notation 


(14.3) 




(14.4) 


and we wish to investigate the solutions of the Schrodinger equation 

H& (r,,r 2 ,...,r„) = e# (r,,r 2 ,... ,r„) (14.5) 


14.2 Many-Body Problem 

The many-electron atom is an example of a so-called many-body problem. These are not 
unique to quantum theory; a familiar example is planetary motion. Newton’s equations 
of motion can be solved exactly for the motion of any one of the planets round the sun 
individually, but the planets also attract each other. During the eighteenth and nineteenth 
centuries a great deal of effort was expended trying to find an exact solution to planetary 
motion, but all efforts failed and it is generally accepted that exact solutions do not exist, 
even for just three bodies. Astronomers are lucky in the sense that the gravitational force 
depends on the product of two masses and the mass of the sun is much greater than the masses 
of the individual planets. Ingenious techniques were developed to treat the interplanetary 
attractions as small perturbations, and the planetary problem can be solved numerically to 
any accuracy required. 

To continue with the planetary motion analogy, chemists are less lucky in one sense 
because the electrostatic force depends on the product of two charges and in the case of an 
electron and a proton these forces are roughly equal in magnitude. It will therefore come 
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as no surprise when I tell you that exact solution of the many-electron atomic Schrodinger 
problem seems to be impossible, because of the electron-electron repulsions, and that we 
are apparently in a weaker position than the astronomers. On the positive side, I have 
stressed the Born interpretation of quantum mechanics. Here we do not focus attention 
on the trajectories of the individual particles, rather we ask about the probability that a 
region of space is occupied by any one of the particles. The quantum mechanical problem 
therefore seems to be more hopeful than the astronomical one. 


14.3 Orbital Model 

Let me now investigate the Schrodinger equation that would result from an atom consisting 
of electrons that did not repel each other. We can think of this as some ‘zero-order’ approx¬ 
imation to a true many-electron atom, just as the astronomers might have investigated their 
simple model of planetary motion. 

We therefore write 

H'P (r 1; r 2 ,.. ,r„) = e# (r^rj,... r„) 

(^2 ~ hW fa) j * fa. r 2 > • • • r „) = e# fa,r 2 ,... r„) 

and this appears to be a candidate for separation of variables. I write 


V (r lt r u ...r l ) = fi (r,) f 2 (r 2 ) (r„) (14.6) 

Substitution and separation gives n identical one-electron atom Schrodinger equations, and 
so the total wavefunction is a product of the familiar Is, 2s, 2p atomic orbitals discussed 
in Chapter 13 (with nuclear charge Ze). The energy is given by the sum of the orbital 
energies. 

We then have to take account of electron spin and the Pauli principle, as discussed in 
Chapter 13.1 can remind you of the principles by writing down some of the lowest energy 
solutions for helium, in particular those that formally involve the Is and 2s orbitals. I will 
label the atomic orbitals Is and 2s for obvious reasons, and I will adopt the habit of writing 
for the spin variable (sorry about the double use of the same symbol) as in Chapter 13. 

The allowed wavefunctions are given in Table 14.1. 

'P ] describes the spectroscopic ground state, giving energy 2e u . The remaining wave- 
functions describe excited states. 'i J 2 is the first excited singlet state whilst and 

are the three components of the first triplet state. We refer to W 2 through as singly excited 
wavefunctions because they have been formally produced from the ground-state wavefunc¬ 
tion by exciting a single electron. Due to my neglect of electron repulsion, the energies of 
\P 2 through t^ 5 are the same. d> 6 is a doubly excited wavefunction and so on. 

Our zero-order model predicts that the singlet and triplet excited states derived from a 
Is 1 2s 1 orbital configuration will have energy 
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Table 14.1 Some electronic states for helium 


State 

Spatial part 

Spin part 

Energy, s 


1 s(r, )1 s(r 2 ) 

^={a(s,)IKs2)~a(szms t )) 

2e u 

<I> 2 

-^=(1s(r,)2s(r 2 ) + 1s(r 2 )2s(r,)) 

^=(a( Sl )^(s 2 )-a(s 2 )^(s,)) 

®ls + ®2s 

W 3 


a(Si)a(s 2 ) 

Sis + S 2s 


-!=(ls(r,)2s(r 2 ) - 1 s(r 2 )2s(r,)) 

-^=(a(s,)P(s 2 ) + afeljS/s,)) 

®ls + S 2s 

% 


/KsO/ife) 

®1s + S 2s 

4> b 

2s(r 1 )2s(r 2 ) 

-J=(«(s 1 )/i(s 2 )-a(s 2 )/7(s 1 )) 

2s 2s 


above the ground state. Since Z = 2forheliumwecalculateawavenumberof329212cm 1 . 
Experimental data can be found at the NBS/NIST website http://www.nist.gov as given in 
Table 14.2. 


Table 14.2 Experimental data for helium 


Configuration 

Term 

J Term value/cm 1 

Is 2 

’S 

0 0 

Is^s 1 

3 S 

1 159 856.07760 

Is^s 1 

'S 

0 166 277.542 


I should explain that the J quantum number is similar to the j quantum number we 
met in our study of one-electron atoms in Chapter 13; for light atoms such as helium 
it is determined by combining together the individual electron orbital and spin angular 
momentum quantum numbers, according to a set of well-known rules called the Russell- 
Saunders scheme. We combine together the l quantum numbers for the two electrons; 
since / = 0 for an s electron, the allowed resultant L is also 0. The electronic states are 
therefore S states. We also combine together the spin quantum numbers s. Since s = '/ 2 
for an electron, the allowed values of the resultant S are % + v /% and l /% — y 2 and the spin 
multiplicities 25 + 1 are 1 and 3. We then combine the L’s and the S ’s in the same way 
to get J. 

The zero-order model is not even qualitatively correct; it overestimates the energy dif¬ 
ference between the ground state and the excited states, and has nothing at all to say about 
the experimentally interesting difference between the singlet and the triplet excited states. 
This poor agreement with experiment is mostly due to our neglect of electron repulsion 
but is in part due to the fact that each electron shields the other electron from the Coulomb 
attraction due to the nucleus. 
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14.4 Perturbation Theory 

There are very few physically interesting problems that we can solve exactly by the standard 
methods of quantum theory. The great majority of problems, including those of atomic and 
molecular structure, must therefore be tackled by approximate methods. 

Suppose that our problem is to solve, for example, the helium atom electronic Schrodinger 
equation 

HMi r 1 ,r 2 )=e i lPi(r 1 ,r 2 ) (14.7) 

We might suspect that this problem is similar to that of two superimposed hydrogen atoms, 
for which we can find exact solutions to the zero-order Schrodinger equation 

(r 1? r 2 ) = sfV® (r la r 2 ) (14.8) 

The aim of perturbation theory is to relate the solutions of problem (14.7) to the exact 
zero-order solutions of (14.8). To simplify the notation, I will drop all references to two 
electrons; perturbation theory is a general technique, not one that is specific to helium. 
It is also general in that it can be applied to every solution not just the lowest energy 
one. There are two technical points. First, I am going to assume that the state of interest 
is not degenerate. There is a special version of perturbation theory that is applicable to 
degenerate states, and I refer the interested reader to the classic texts such as Eyring et al. 
(1944). Second, I am going to assume that the wavefunctions are real rather than complex. 
It makes the equations look a bit easier on the eye. 

We proceed as follows. First we write the Hamiltonian as 

H = H i0) + XH m (14.9) 

where X is called the perturbation parameter. The second term is called the perturbation. 
We assume that the energies and wavefunctions for our problem can be expanded in terms 
of the zero-order problem as 

V t = t^ (0) + XV™ + X 2 V™ + ■■■ 

(14.10) 

Si = e- 0) + Ae- 1 ’ + X 2 ef 2) + ■ ■ • 

The superscript ‘(k)’ refers to the order of perturbation theory, and the equation should 
demonstrate why the perturbation parameter X is added; it is a formal device used to keep 
track of the ‘orders’ of the perturbation. It might physically correspond to an applied electric 
field (as in the Stark effect) or an applied magnetic induction (as in the Zeeman effect, in 
which case we need to use complex wavefunctions). If we substitute the expansions into 
our problem we find 

H (0) V ( o) + X (ff (1) V. m + H m V m ) + X 2 (H m V m +H (f>) V ( - 2) \ + ■■■ 

v ; v > (14.11) 

m + X (efV/ 05 + efv™) + X 2 (efV. (0) + e^V™ + ef 4- 
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In order that Equation (14.11) may be true for all values of X, the coefficients of X on 
either side of the equation must be equal. Equating and rearranging we find 


w (0 V/ 0) = 

(£ (0) _ e f») = ( e 0) _ fftDj (14.12) 

( H (0) - 4 0) ) ^- (2) = efV® + s, a y. (1) - //'V/ 15 


Solution of these equations gives 


IjWrrl 1 ) 

4-^ -89 _ „(0 


^ = ^,. (0) + A J]- 


(14.13) 


I have used the following shorthand for the integrals: 

H^P = J ^H m Wj <0) dr (14.14) 

The first-order correction to the energy can therefore be calculated from the unperturbed 
wavefunction and the perturbing Hamiltonian: 


= hP 


J dr 


On the other hand, the second-order correction to the energy requires knowledge of the 
remaining states 


!, = £ 


(s*r. 


If we return to the helium atom where the zero-order problem is two noninteracting 
atomic electrons, the zero-order wavefunctions and energies are shown in Table 14.1. The 
perturbation is the Coulomb repulsion between the two electrons 


4tt£o r 12 


and the first-order correction to the ground-state energy is 




) Vi dr 

47T£o r u J 


where is given in Table 14.1. 

Evaluation of the integral is far from easy, since it involves the coordinates of both 
electrons (it is therefore a six-dimensional integral), and it has a singularity (it tends to 
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infinity as the two electrons approach each other). It is shown in the classic texts such as 
Eyring et al. (1944) that 

(i) _ 5Z e2 

8 4jrs 0 a 0 

s m _ 5Z e 2 

hc 0 8 hc 0 47Te 0 a 0 
= 274 343 cm" 1 

and my revised estimate of the first ionization energy is now 164606 cm -1 , in better 
agreement with experiment. 


14.5 Variation Method 


Another, completely different method of finding approximate solutions to the Schrodinger 
equation is based on the following theorem. 


Theorem 

If the lowest eigenvalue of a Hamiltonian H is s and <T> is a function with the correct 
boundary conditions, then 

/ 0*H<P dr 
f 0*0 dr ~ S 

Once again, the proof is given in all the classic texts such as Eyring et al. (1944). I can 
illustrate the use of this technique by reference to the helium atom. The spatial part of the 
zero-order wavefunction is 

*(r„r,)=!^xp(-Z^) 


where Z is the atomic number (2 in this case). We now allow for the possibility that one 
electron partially screens the nucleus and so the second electron sees a reduced nuclear 
charge Z'. This suggests that we use a trial wavefunction that comprises a product of 
modified helium Is orbitals with effective nuclear charge Z'e 


^ . (Z') 3 / , (f \ + r 2 ) \ 

® (ri,r 2 ) = — r exp -Z- 

nai \ a 0 ) 


We would expect Z' to lie between 1 and 2 and we look for the value of Z' that makes 
the variational integral 


f 0 (r u r 2 ) (r,) + (r 2 ) + g (r,,r 2 )) 0 (r,, r 2 ) dr,dr 2 

/ 0 2 (r l5 r 2 ) dri dr 2 


(14.16) 


a minimum. I have dropped the complex conjugate signs * because of the real nature of the 
wavefunction. The denominator is 


/ 0 2 (r,, r 2 ) dTjdrj = / Is 2 (r,)dT, / H 2 (r 2 ) dr 2 
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The numerator is 

e' = J 0 (r l5 r 2 )h m (rO 0 (r,, r 2 ) dr, dr 2 + J 0 (r x , r 2 )h m (r 2 ) 0 (r x , r 2 ) dtj dr 2 

+ J & (ri, r 2 )g (r 1( r 2 ) 0 (r,,r 2 ) dr, dr 2 

r . r r . < 14 ' 17 ) 

s' = / Is (r,)/j (1) (r,) Is (r, ) dr, / Is 2 (r 2 )dr 2 + / Is (r 2 )h m (r 2 ) Is (r 2 ) dr 2 

x f Is 2 (r,)dr, + f Is (r x )ls (r 2 ) — Is (r,) Is (r 2 ) dr, dr 2 
J 4jre 0 J r n 

The first two integrals are related to the energy of a 1 s orbital of He + ; each integral is equal 
because of the indistinguishability of the two electrons, and I can write each one as 

/ (I? + It 5) - 

+ / is ^^r) is<r ‘ )dr ' 

=4 ( z ') ,£ >+/ u <■■■> (^r) ls <■•■) dT ‘ 

= - i (Z') 2 £ h + Z'(Z' - Z)£ h 

Adding the two-electron integral from above we find a variational energy of 

e = ^(Z') 2 - 2ZZ' + ^Z'^ E b (14.18) 

We obtain the best approximation to the true energy by giving Z' the value which will 
make the energy a minimum. This means 

§=(2Z'-2Z+0i5, 

= 0 

and so the best energy results when we take 

Z ' = Z - — 

16 

This leads to an improved ground-state energy of 


By introducing more and more parameters like Z' into the wavefunction 0 we can 
approach more and more closely the experimental result. Hylleraas (1930) experimented 
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with wavefunctions that were explicit functions of the interelectron coordinate r 12 . His first 
attempt was to write 

<P= A (exp (—Z' (n + r 2 )) (1 + cr 12 )) (14.19) 

where A is the normalization constant and Z' and c are adjustable parameters. In later work 
he made use of expansions such as 

0 = A (exp (—Z' (r, + r 2 )) (polynomial in r u r 2 , r 12 )) (14.20) 

and was able to demonstrate impressive agreement with experiment. We refer to such 
wavefunctions as correlated wavefunctions; as we will soon see, the most sophisticated 
orbital models average over the electron interactions whilst correlated wavefunctions allow 
for the ‘instantaneous’ interaction between electrons. 

Some of Hylleraas’s results are shown in Table 14.3; the most accurate theoretical value 
known is that of Frankowski and Pekeris (1966). 


Table 14.3 Hylleraas's variational results for helium 


Approximate wavefunction 

e/E h 

exp (—k (r, +r 2 )) 

-2.8478 

exp (- k (r, + r 2 )) exp (-c, r, 2 ) 

-2.8896 

exp (-k (r, + r 2 j) exp (-c, r 12 ) cosh (c (r, - r 2 )) 

-2.8994 

ex P ( - 2 r2 )) ( C ° + Clh + Ci ^ ~ + Cl ^ r ' + r 2) 

-2.9032 

+ c 4 (r, + r 2 ) 2 + c 5 r, 2 2 ) 


'Exact' 

-2.90372437703 


Hylleraas’s approach whereby we write the interelectron distances explicitly in the wave- 
function (and so abandon the orbital model) gives by far the most accurate treatment of 
atomic systems, but like most attractive propositions there is a catch. I glossed over calcu¬ 
lation of the two-electron integral in the discussion above, but this causes a major problem 
for molecular systems. Any attempt to use hydrogenic orbitals for a molecular system 
leads to two-electron integrals that are impossibly hard to evaluate. Despite many attempts, 
Hylleraas’s method has never been successfully applied to a large polyatomic molecule. 


14.6 Linear Variation Method 

The variation principle as stated above applies only to the lowest energy solution of any 
given symmetry (spatial and spin). A special kind of variation function widely used in 
molecular structure theory is the linear variation junction. For the sake of argument, suppose 
we try to improve the helium atom ground-state wavefunction 0] by adding 0 2 through 0 (l 
(given in Table 14.1) and so write 


0 = C\ 0\ + c 2 0 2 + • ■ • + c 6 0 6 
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We keep the effective nuclear charge Z' constant throughout the calculation (since it 
is a nonlinear parameter), but seek the values of the linear parameters Ci through c 6 that 
minimize the variational integral 

_ f 0*Hd> dr 
e ~ f 0*0 dr 

For neatness, I will recast the problem in matrix notation and treat the general case with 
n rather than six <^’s. We write 


0 = ( ^ & 2 ■■■ ^n) 


I will also collect integrals such as 

Hy = J 0*H0j dr and S v 

into the n by n matrices H and S. I will generally work with real rather than complex 
wavefunctions, so we can drop the complex conjugate sign * and the variational integral 
becomes 

c T Hc 

c T Sc 

We now let c change by a small amount 5 c and find the change in e (which we will then 
set to zero for a stationary point). We have 

(c + <5 c) t H (c + 5c) 

(c + 5c) S (c + 5c) 

_ c T Hc + (5 c) t He + c T H5c + (5c) T H5c 
" c T Sc + (5c) T Sc + c T S5c (5c) T S5c (14.21) 

c T Hc + (5 c) t He + c t H5c + (5c) T H5c 
_ c T Sc (1 + ( ^ c)T ^ c i cT S <s > c (5 c) t S5c\ 

cSc { l + ~^ + ^ + ~^sT) 

Expanding the denominator by the binomial theorem and retaining only first order in 5 c 
we find 

, + = ( CTHC + (fa)T " C + + "i (l - »> T ^ + + ..(14.22) 

c T Sc \ c T Sc / 

and after multiplying out 

(e + 8e) c T Sc = (5c) T (He - eSc) + (c T H - ec t S) 5c + • • • (14.23) 

The two terms on the right-hand side are simply matrix transposes of each other; they 
carry the same information. I did not place any requirements on 5c: it is quite arbitrary and 
so for a stationary point we must have 


V c„ / 

J d 


He = eSc 











The Orbital Model 225 


This is called a generalized matrix eigenvalue problem and there are exactly n possible 
solutions Ci, Bp, c 2 , £2', ■ ■ ■ c„, e„. Each of the n solutions is an upper bound to n electronic 
states (including the ground state), and so the linear variation method has the added bonus 
of giving approximations to each of n states simultaneously. Not only that, but if we add 
further 4^’s and repeat the calculation, each of the energies will at worst stay the same or 
possibly get closer to the true energy. I have illustrated MacDonald’s theorem for the case 
n = 6 in Figure 14.2. 



Figure 14.2 MacDonald's theorem 


Addition of <^ 7 has had no effect on the n — 6 approximation for <P 3 but has (for example) 
lowered the energy of the tfq approximation. 

I have stressed the special property of atoms; they are spherical and their electronic 
Hamiltonian commutes with the square of the angular momentum operator, together with 
the z component. The Schrodinger equation makes no mention of electron spin and so the 
Hamiltonian also commutes with the square of the spin angular momentum operator and 
the z component. In Table 14.4,1 have summarized the relevant quantum numbers for the 
six states of helium considered. 


Table 14.4 Angular momentum quantum numbers for helium 
states 



L 

M, 

s 

M s 


0 

0 

0 

0 

V 2 

0 

0 

0 

0 

4/3 

0 

0 

1 

1 

oq 

0 

0 

1 

0 

V 5 

0 

0 

1 

-1 

O'6 

0 

0 

0 

0 
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It can be shown (e.g. Eyring etal. 1944) that if 'P l and 'Pj are eigenfunctions of an operator 
A that commutes with the Hamiltonian H, and that the eigenvalues are different 

A4>, = 

APj = cijPj 
a i a j 


then 


J pfip, dr = 0 


This means that the Hamiltonian matrix will have a simple form 

/ H n H u 0 0 0 H l6 \ 

H 2I H 22 0 0 0 H 26 

0 0 #33 0 0 0 

0 0 0 H44 0 0 

0 0 0 0 H 55 0 

H a H 62 0 0 0 H 66 

so instead of a 6 x 6 matrix eigenvalue problem, we have a 3 x 3 and 3 @ 1 x 1. There is 
nothing particularly hard about solving a 6 x 6 matrix eigenvalue problem, but this simple 
example demonstrates how angular momentum can be used to help with atomic calculations. 
In the case of polyatomic molecules, things are not so easy. Angular momentum operators 
do not generally commute with molecular Hamiltonians and so molecular problems are 
much harder than atomic ones. Spin operators commute with molecular Hamiltonians, as 
do symmetry operators. 


14.7 Slater Determinants 

Solution of the appropriate electronic Schrodinger equation is only one aspect of a calcu¬ 
lation; we have to also take account of electron spin and because electrons are fermions 
the electronic wavefunction has to satisfy the Pauli principle. Neither electron spin nor the 
Pauli principle appears from the Schrodinger treatment. As I mentioned in Chapters 12 and 
13, Pauli’s principle can be stated in a number of different ways; I am going to restate it as 

Electronic wavefunctions must be antisymmetric to exchange of electron names. 

I produced the helium orbital wavefunctions in Table 14.1 by a somewhat ad hoc method; 
I constructed suitable spatial parts and spin parts, which I combined together in such a 
way that the Pauli principle was satisfied. A more systematic method for constructing 
antisymmetric orbital wavefunctions is needed. 

Electron spin can be conveniently treated by combining spatial orbitals with the spin 
functions a and /3; for a given spatial orbital i//(r) we work with two space and spin 
wavefunctions that we write \jr(r)a(s) and \lr{r)fi(s)\ these are usually called spinorbitals 
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and electrons are allocated to spinorbitals. A spinorbital can hold a maximum of one electron. 
Suppose then that we have four electrons that we wish to allocate to the four spinorbitals 
f A (r)a(s), i/r B (r )a(s), f A (r)P(s) and ^r B (r)P(s). One possible allocation is 

f A (r,) a (.v,) f A (r 2 ) p (s 2 ) f B (r 3 ) a (s 3 ) f B (r 4 ) P (s 4 ) 

but we must now allow for the indistinguishability of the electrons and take account of all 
the remaining 4! - 1 permutations of electrons through the spinorbitals, for example 

f A (r 2 ) a (s 2 ) f A (r,) P (.v,) f B (r 3 ) a (.s 3 ) f B (r 4 ) p (s 4 ) 

which has been obtained from the original allocation by permuting the names of electrons 
1 and 2. All the 4! permutations have to appear with equal weight in the total wavefunction, 
and in order to satisfy the Pauli principle we multiply each of them by the sign of the 
permutation. This is — 1 if we permute an even number of electrons, and +1 if we permute 
an odd number. A total orbital wavefunction that satisfies the Pauli principle will therefore be 

V = f a (r,) a (.v,) f A (r 2 ) P (s 2 ) i/r B (r 3 ) a (,s- 3 ) (r 4 ) P (.s' 4 ) 

- f A (r 2 ) a (v 2 ) iAa (r,) P (.s',) f B (r 3 ) a (,s- 3 ) iAr (r 4 ) P (.v 4 ) H- 


John C. Slater is credited with having noticed that these terms could be written as a 
determinant (of order 4, in this case), which we construct as 

f A (r,) a (.s',) iA a (r,) (v,) ^(r,)«(.?,) f B {r i)/Hsi) 

w fA (r 2 ) a (s 2 ) f A (r 2 ) p (s 2 ) f B (r 2 ) a (s 2 ) f B (r 2 ) P (s 2 ) 

iA A (r 3 )a(s 3 ) f A (r 3 ) p (,s- 3 ) f B (r 3 )a(j 3 ) f R (r 3 ) p (,v 3 ) ^ j 

f A (r 4 ) a (,s' 4 ) f A (r 4 ) p (,s' 4 ) f R (r 4 ) a (,s' 4 ) f R (r 4 ) p (.v 4 ) 

Some authors write the determinants with rows and columns interchanged, which of course 
leaves the value of the determinant unchanged: 

f A (r,)a(.s',) f A (r 2 )«(.s' 2 ) f A (r 3 )a(j 3 ) f A (r 4 )« (,s' 4 ) 

^ (r,) P (s,) f A (r 2 ) p (s 2 ) ii A (r 3 ) p (s 2 ) f A (r 4 )P(s 2 ) 
f B (r,)a (.s',) iAb (r 2 )a (s 2 ) iAb (r 3 )a (.v 3 ) f B (r 3 )a (.s' 3 ) 
iAr (r,) (.s'i) f B (r 2 )P(s 2 ) f R (r 3 ) p (s 3 ) V^b (r 4 ) P (s 4 ) 


It is an attractive property of determinants that they change sign if we interchange two 
rows (or columns), and this is formally equivalent to interchanging the name of two of the 
electrons. Also, if two columns are the same then the determinant is zero, which is formally 
equivalent to letting two electrons occupy the same spinorbital. 

Not every electronic state of every atom or molecule can be written as a single Slater 
determinant and linear combinations are then needed. For example, of the wavefunctions 
shown in Table 14.1, we see by inspection that 'T /,, </ / 5 , & 5 and *P 6 can be written as single 
Slater determinants, for example 

to - fl I ls ( r i) 01 ( 5 i) ls ( Fl ) ^ ( J i) I to-F-\ llS ( r 0 “ Oi) 2.? (r,)a (,?,) I 

1 V 2 | Is (r 2 ) a (s 2 ) Is (r 2 ) P (s 2 ) | V 2 | Is (r 2 ) « (s 2 ) 2j(r 2 )a(s 2 )| 
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but W 2 and have to be written as a sum of two determinants, for example 

iz/=l{| ls ( r i)“( s i) 2s fa) p fa) |_| Is (r,)/3 0,) 2s fa) a fa) II 
2 2 { | Is (r 2 ) a fa) 2s (r 2 ) p fa) \ \ Is fa) P fa) 2s fa) a fa) \J 


14.8 Slater-Condon-Shortley Rules 

Slater determinants are compact summaries of all possible permutations of electrons and 
spin orbitals but the way I have written them down is unwieldy, and many authors adopt sim¬ 
plified notations. Suppose for example we have a many-electron system whose electronic 
configuration can be written 

(V'AfCV'B) 2 . . -Om ) 2 

This is chemical shorthand for 2 M spinorbitals ijr A onp' A p i// B afap.. fa M a^/ M p occupied 
by 2 M electrons. One convention is to write A for ip A a A and A for ij/ A p A with the Slater 
determinant represented 

D= \AABB...MM\ 

From time to time we need to know the expectation values of sums of certain one-electron 
operators and certain two-electron operators. Suppose that there are n electrons; these are 
indistinguishable and so any sum of operators must include all of them on an equal footing. 
Expectation values are typically the electronic contribution to the molecular electric dipole 
moment 

-e J V r,^ & dr 

and the electron repulsion in a polyatomic system with n electrons is 

1 1> has to be a linear combination of Slater determinants D l ,D 2 ,... so we need a systematic 
set of rules for working out such expectation values between single Slater determinants 
that I will call D x and D 2 which are, in the simplified notation 

A = \UVW...Z\ 

D 2 = \U'V’W' ...Z'\ 

The first step is to rearrange one of the determinants to make as many of the spinorbitals 
equal as possible. This may introduce a sign change. The algebra is easier if we assume 
that the individual orbitals are normalized and orthogonal. Consider the overlap integral 

J D x D 2 dr 

where the integration is over the space and spin coordinates of all the electrons. Each determ¬ 
inant expands into n ! terms, and the product has (nl) 2 terms. On integration, orthonormality 
of the individual spinorbitals means that there will be just n\ nonzero terms each equal to 
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1 with all remaining terms zero. If at least one of the spinorbitals are different, say U ^ U', 
then the complete overlap integral is zero. Thus 




0 otherwise 


This gives a Slater determinant normalizing factor of 1 l\fn\. 

The rules for sums of one- and two-electron operators can be found in more advanced 
texts such as Eyring et al. (1944); all we need to note are the following results. 

1. If two or more spinorbitals are zero, then the expectation value of a sum of one-electron 
operators is zero. 

2. If three or more spinorbitals are different, then the expectation value of a sum of two- 
electron operators is zero. 


14.9 Hartree Model 

We now return to the problem of the helium atom. We have established that the electronic 
wavefunction would be exactly a product of hydrogenic orbitals in the absence of electron 
repulsion. We have also seen that neglect of electron repulsion leads to impossibly poor 
agreement with experiment. The orbital model is an extremely attractive one, so the question 
is how can we allow for electron repulsion in some average way, whilst retaining the orbital 
picture? Douglas R. Hartree’s solution to the problem was to allow each electron to come 
under the influence of an average potential due to the other electron and the nucleus. 
Suppose for the sake of argument that electrons 1 and 2 occupy orbitals i/r A and i// R (which 
might be the same, but are to be determined). We could have a guess that the ^’s might be 
hydrogenic, to get the calculation started. 

According to the familiar Born interpretation, i/z^dr is a probability and so we can regard 
the second electron as a charge distribution, with density —ei/r^. Electron 1 therefore sees 
a potential due to the nucleus and the smeared-out electron 2, as shown in Figure 14.3. 



electron 1 


Figure 14.3 The Hartree model 
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The electronic Schrodinger equation for electron 1 is 



(14.25) 


That gives us i/r A . We now focus on electron 2, for which 



(14.26) 


and calculate fi B then back to electron 1 so on. The calculation is an iterative one and 
we stop once the change between iterations is sufficiently small. Each electron experi¬ 
ences a field due to the remaining electrons, and at the end of the calculation the average 
electron density derived from the field must be the same as the field and so Hartree 
(1927) coined the phrase self-consistent field (SCF for short). William Hartree and his 
son Douglas did much of the early work and so we speak of the Hartree self-consistent 
field method. 

The theory of atomic structure is dominated by angular momentum considerations, since 
the square of the orbital angular momentum operator and its z component commute both 
with each other and the electronic Hamiltonian. This simplifies the problem considerably 
and the Hartrees wrote each atomic orbital as 


fi (r) = -P(nl- r)Y lm (6,fi) 


(14.27) 


where P(nl; r) is a radial function and Y lm a spherical harmonic. Their notation for P should 
be clear; each shell has a different radial function. From now on I am going to make the 
notation more consistent with previous chapters and write 


fi{r) = -P nl {r)Y lm (6,fi) 


Thus for fluorine we would expect three different radial functions, P ls (r), P 2g (r) and P 2f (r). 
Because of the spherical symmetry of atoms, all the 2p solutions have the same radial part. 

Details of the method are given in the book The Calculation of Atomic Structures (Hartree 
1957) and essentially the radial functions are determined from the variation principle. That 
is to say, they are chosen so as to minimize the variational energy 


f d>*H<P dr 


f 0*0 dr 


14.10 Hartree-Fock Model 

Hartree’s calculations were done numerically. It soon became apparent that these early 
calculations gave energies that were in poor agreement with experiment; Fock (1930a,b) 
pointed out that Hartree had not included the Pauli principle in his method. Essentially, the 
Hartree model considered a simple orbital product such as 


tfWe = * A (ri)« (.V,) fi A (r a ) f> (s 2 ) fi B (r 3 ) a (.v 3 ) fin (r 4 ) fi (s 4 ) 


(14.28) 
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whilst the Hartree-Fock (HF) model uses a fully antisymmetrized wavefunction such as 


if a (r,) a 0 ,) 
if a (r 2 ) a (s 2 ) 
if a (r 3 ) a (s 3 ) 
if a (r 4 ) a (s 4 ) 


if a ( r i) P Csr) 

if a (r 2 ) P (s 2 ) 
if a (r 3 ) P (s 3 ) 
if a (r 4 ) P (s<) 


if B (ri) a (Si) 
if B (r 2 ) a ( s 2 ) 
if B (r 3 ) a (s 3 ) 
if B (r 4 ) a (s 4 ) 


ifB (r,)^(i,) 
ifB (r 2 ) P (s 2 ) 
ifB (r 3 ) P (s 3 ) 
ifB (r 4 ) P 0 4 ) 


(14.29) 


In the simplest version, HF theory concentrates on electronic states that can be represented 
as a single Slater determinant. We find an extra term in the energy expression called the 
exchange energy , discussed earlier in Chapter 12. Evaluation of the energy needs a more 
complicated numerical procedure then the simpler Hartree theory. Inclusion of electron 
exchange by the numerical methods used in their day proved more and more difficult for 
atoms towards the bottom right-hand comer of the Periodic Table. The Hartrees were able 
to study a wide range of atoms in different electronic states, together with their ions. To 
give a flavour of systems studied, I have reproduced in Table 14.5 the systems that Hartree 
(1957) reports in his book, which cover the period 1948 through 1957. 


Table 14.5 Selection of atoms treated in Hartree (1957) 


Atom 

Atom 

Atom 

Atom 

H“ 

Ne 2+ 

Cl 

Fe 16+ 

He+ 

Na 

Cl 

Zn 2 + 

Li+ 

Na + 

Ca 

Zr 4+ 

Be 

Mg 

Ti 2+ 

Mo + 

B 

AF 

V + 

ln 3+ 

C 

Al 3+ 

Mn 

Sb 3+ 

0 6+ 

S“ 

Mn + 

Au + 

F~ 

S 

Fe 13+ 

Tl + 




Tl 2+ 


The author mentions that all up to Mn 2+ and also Fe 16+ and Zn 2+ have 
exchange terms included in the calculations. 


14.11 Atomic Shielding Constants 

Hartree-Fock wavefunctions are the best wavefunctions that are possible within the orbital 
model, as shown in Figure 14.4. Wavefunctions A and B are simple orbital wavefunctions, 
perhaps hydrogenic or improved hydrogenic. The difference between the Hartree-Fock 
energy and experiment is called the correlation energy. 

Output from an atomic HF program consists of the radial function, together with data 
for each shell such as those shown in Table 14.6. 

It is usual to work with normalized radial functions, and this determines the functions 
apart from their sign (for if P ni (r) is a solution, so is —P„i(r)). Hartree used a convention 
that the radial function should be positive near the nucleus, whilst other authors use a 
convention that it should be positive for large r. 

Several authors pointed out that it was often desirable to have simple approximations to 
the wavefunctions and energy levels of atoms and ions. For an atom of n electrons, there are 
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- Hartree Fock 


Exact 


Figure 14.4 Hartree-Fock energy 

Table 14.6 Properties output from an atomic HF study 

e 

Orbital energy 

A 

Initial slope P n/ (r) /r ,+ \ -f~* 0 

s 

Screening parameter 

(1/R 3 ) 

Expectation value of 1/P 3 

(1 /R) 

Expectation value of 1/P 

{R) 

Expectation value of P 

(R>) 

Expectation value of P 2 

Virial ratio 

(Potential) / (Kinetic) 

Spin-orbit coupling 

Kn! 

Orbit-orbit coupling 

M k (nl, nl) 


3 n independent variables in the Schrodinger equation (plus spin). Hartree (1957) expressed 
the need most dramatically as follows: 

One way of representing a solution quantitatively would be by a table of its numerical values, 
but an example will illustrate that such a table would be far too large ever to evaluate, or to 
use if it were evaluated. Consider, for example, the tabulation of a solution for one stationary 
state of Fe. Tabulation has to be at discrete values of the variables, and 10 values of each 
variable would provide only a very coarse tabulation; but even this would require 10 78 entries 
to cover the whole field; and even though this might be reduced to, say, 5 78 = 10 53 by use of the 
symmetry properties of the solution, the whole solar system does not contain enough matter 
to print such a table. And, even if it could be printed, such a table would be far too bulky to 
use. And all this is for a single stage of ionization of a single atom. 

14.11.1 Zener’s Wavefunctions 

In Section 14.5, we addressed ways of improving the analytical Is orbital for helium, 
especially by treating the effective nuclear charge Z' as a variational parameter. In his 
keynote paper Zener (1930) extended this simple treatment to first-row atoms. As usual, 
I will let the author tell the story in his own words through the abstract. 
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The wave functions for the atoms Be, B, C, N, O, F and Ne are written as simple analytic 
expressions with several parameters. The best values of these parameters are then determ¬ 
ined by the variation method. In the final wave functions the effective quantum number 
is very nearly two, the radial node is so small as to have little effect upon the charge 
distribution, the coefficient in the exponential is related to an empirical ‘mean effective 
charge’. 


14.11.2 Slater’s Rules 

Finally we return to the work of Slater (1930), and once again you might like to read his 
abstract. 

In analogy with the method of Zener for the atoms Li to F, simple rules are set up giving 
approximate analytical atomic wavefunctions for all the atoms, in any stage of ionization. 
These are applied to X-ray levels, sizes of atoms and ions, diamagnetic susceptibility etc. In 
connection with ferromagnetism, it is shown that if this really depends on the existence of 
incomplete shells within the atoms, rather far apart in the crystal, then the metals most likely 
to show it would be Fe, Co, Ni and alloys of Mn and Cu (Fleuser alloys). 

Slater extended Zener’s shielding constants for Li to F to the other atoms by adjusting the 
values until he got agreement with experimental results of stripped atom and X-ray levels, 
atom sizes and the other quantities mentioned. He noticed that Zener’s wavefunctions had 
radial nodes but argued that they were unimportant since they come much closer to the 
nucleus than for hydrogen. Consequently he decided to ignore them altogether and wrote 
a radial part as 

(14.30) 

where n* is an effective quantum number and 5 the shielding constant. Parameters n* and 5 
are found by simple rules (that have become known as Slater’s rules) as shown in Table 14.7 
and discussed below. 




Table 14.7 Slater iT values 


For determining Z — s, the electrons are divided into the following groups, each having 
a different shielding constant: Is; 2s, 2p; 3s, 3p; 3d; 4s, 4p; 4d; 4f; 5s, 5p; etc. That is, 
the s and p of a given n are grouped together but the d and f are separated. The shells are 
considered to be arranged from inside out in the order named. 

The shielding constant s is formed, for any group of electrons, from the following 
contributions. 
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1. Nothing from any shell outside the one considered. 

2. An amount 0.35 from each other electron in the group considered (except the Is group, 
where 0.30 is used instead). 

3. If the shell considered is an s, p shell, an amount 0.85 from each electron with total 
quantum number less ny one, and an amount 1.00 from every electron still further in. 
But if the shell is a d or f, an amount 1.00 from every electron inside it. 

Slater gives three examples, reproduced in Table 14.8, which are worth quoting; C, Fe and 
Fe cation lacking a K-electron so that there is only 1 @ls electron. 

Slater’s rules are still widely quoted in atomic and molecular structure theory. 


Table 14.8 Slater Z - s values 



C 

Fe 

Fe+ (Is 1 ) 

Is 

5.70 = 6 — 0.30 

25.70 = 26- 0.30 

26.00 

2s, 2p 

3.25 = 6-3(0.35)-2(0.85) 

21.85 = 26- 7(0.35)-2(0.85) 

22.70 

3s, 3p 


14.75 = 26- 7(0.35)- 8(0.85) 
-2(1.00) 

15.75 

3d 


6.25 = 26 - 5(0.35) - 18(1.00) 

7.25 

4s 


3.75 = 26- 1 (0.35) - 14(0.85) 
-18(0.85)- 10(1.00) 

4.75 


14.12 Koopmans’ Theorem 

Consider the ionization process 

Ne(ls 2 2s 2 2p 6 ) ->• Ne + (ls 2 2s 1 2p 6 ) 

where I have ionized a 2s electron from neon (Figure 14.5). Suppose that the energy level 
diagram represents a HF calculation on neon before ionization, and that the orbitals do not 
relax in any way after ionization. That is, the neutral atom and the ionized cation have the 
same HF orbitals. According to Koopmans’ theorem, the ionization energy for the process 
is the negative of the HF orbital energy. The theorem holds for all HF orbitals. 


Koopmans’ Theorem 
Ionization from HF orbital if 
Ionization energy = —orbital energy 


Ionization energies can be measured accurately by modern-day versions of the photo¬ 
electric effect. For valence shell ionization we use ultraviolet photoelectron spectroscopy 
(UVPES) and, as the name suggests, ultraviolet photons are the ionizing source. Inner shell 
electrons need X-rays for ionization. 
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Figure 14.5 Ionized neon 
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15 

Simple Molecules 


It is time to progress to the quantum theory of molecules, and it should come as no surprise 
when I tell you that the orbital model is a good starting point for many of the calculations 
we professionals do today. 

Molecules are more complicated than atoms for the following reasons. 

• Whilst angular momentum considerations dominate the theory of atomic structure (it is 
amazing what can be done with pencil and paper before any nasty integrals have to be 
evaluated) the L 2 and L z operators do not commute with molecular Hamiltonians (except 
for linear molecules; if the molecular axis is the z axis, then L z does commute). To a 
certain extent, molecular symmetry operators help us since they commute with molecular 
Hamiltonians, but most molecules of any real chemical interest have no symmetry apart 
from the identity operation. So we are stuck with the Hamiltonian, and nothing to help 
us simplify the eigenvalue problem apart from electron spin. The Schrodinger equation 
for a free molecule in the absence of an applied field does not contain spin and so both 
S 2 and S z inevitably commute with molecular Hamiltonians. 

• Molecules are not spherical, and the great simplifying feature of atomic HF theory 

f{r)= l -P nl (r ) Y l mi (9 , <p) 

is no longer appropriate. This statement follows from the one above. 

• The Hartree-Fock (HF) limit is attainable for atoms by direct numerical integration 
of the HF equations. This limit is unattainable for molecules except in a few simple 
cases of high symmetry, and numerical integration techniques that are fine for atoms are 
inapplicable for molecules. In any case, we have to concern ourselves with calculations 
that are beyond the HF limit if we want to study chemical reactions. 
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15.1 Hydrogen Molecule Ion, H 2 + 

The simplest molecule is the hydrogen molecule ion, H 2 + , shown in Figure 15.1. It is 
formed by passing an electric discharge through dihydrogen, and it has been well studied 
experimentally. Its dissociation energy is known to be D t — 269.6 kJ mol -1 and it has an 
equilibrium bond length of 106 pm. There is only one well-established electronic state, 
namely the ground state. 



First of all, in quantum mechanics just as in classical mechanics, we can rigorously 
separate off the translational motion of the molecule. That leaves us to concentrate on 
the two nuclei (each of mass m p = 1.673 x 10 _27 kg) and the electron (of mass m c = 
9.109 x 10 -31 kg) about the centre of mass. The wavefunction therefore depends on the 
coordinates of the electron (r) and the two nuclei (R A and R B ): 

^ tot - <P lnl (R a , R b , r) 

If we regard such a system from the viewpoint of classical mechanics, we would be 
tempted to try and separate the motions of the nuclei and the electron, because of the great 
difference in their masses (a factor of 1:1836). This does not mean that the motions are 
truly separable like the molecular translational motion of the centre of mass and the relative 
motion of the nuclei and electrons within the molecule, more that the separation can be 
done with only a small error. 

Born and Oppenheimer (1927) first investigated this possibility and wrote 
i/V (R a , R b , r) = <P nuc (R a , R b ) f e (R a , R b , r) 


I am using a convention that lower case refers to one electron, whilst upper case 'P refers 
to many electrons. In this case there is only one electron and so the electronic wavefunction 
is technically an orbital ; in this case a molecular orbital (MO). Born and Oppenheimer 
showed that the approximation was good to almost (but not exactly) the ratio of the particle 
masses and so we normally glue the nuclei to fixed positions in space and concentrate on 
the electron(s) when investigating many problems in molecular electronic structure theory. 
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We treat the nuclei separately; if we want to know about vibrational and rotational motion 
we have to solve the relevant nuclear vibrational Schrodinger equation and the relevant 
nuclear rotational Schrodinger equation. This is not the same as doing a MM calculation 
for the nuclei: there are proper quantum mechanical vibrational and rotational Schrodinger 
equations as we have seen in Chapter 11. 

The separation of electronic and nuclear motions allows us to specify a molecular geo¬ 
metry; the electrons experience an electrostatic potential energy due to the nuclei, which 
are fixed at positions in space for the purpose of calculating the electronic wavefunction. 
Just as for MM, molecular geometries can be found by investigating stationary points on 
the potential energy surface. 

The electronic wavefunction is given as the solution of an electronic Schrodinger equation 

-sfc(£ + £ + £)+ u *• ' R ‘- R - r >- (R *’ R - r > 


where the electrostatic potential energy is 

4TO,Vlr-R,l r — R B | / 


We solve the electronic equation and the total energy is given by adding on the fixed nuclear 
repulsion 

£ - = £ - + 4, £0 l R t-R B | (15 ' 2) 

The hydrogen molecule ion is unique amongst molecules in that we can solve the elec¬ 
tronic Schrodinger equation exactly (by numerical methods) to any required accuracy, 
within the Born-Oppenheimer approximation. The first step is to make a change of variable 
to so-called elliptic coordinates that are defined with reference to Figure 15.2: 

r A + r B r A -r B 

In this system of coordinates the electronic Schrodinger equation is 

a 2 


87 x 2 r, 


b (M 2 ■ 


1 if e + U e = £„1 fa 


+_^_ 

{p? - 1) (1 - V 2 ) 

and we look for a separation of variables solution 

Since 0 only enters the equation in the d 2 /d<p 2 term, it is at once apparent that we can factor 
off the 0 term. If we call the separation parameter —A. 2 we have 


and so 


(15.4) 
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Figure 15.2 Elliptic coordinates 


where X can take on positive and negative values, and j is the square root of — 1. In the 
limit as R A n —► 0, the quantum number X becomes equivalent to the atomic quantum 
number m h 

Separation into a p and a v equation also proves possible, and the equations have 
been solved by Teller (1930), by Burrau (1927) and by others leading to results in com¬ 
plete agreement with experiment. Solution of the differential equations is far from easy 
and the best references are Bates et al. (1953), Wind (1965) and of course Eyring et al. 
(1944). 


15.2 LCAO Model 

The hydrogen molecule ion is unusual amongst molecules in that we can solve the electronic 
problem exactly (by numerical methods). Once we consider polyelectron systems, we have 
to seek approximate methods. Any chemist would argue that molecules are built from atoms 
and so we should capitalize on this chemical knowledge by attempting to build molecular 
wavefunctions from atomic ones. 

Suppose we build the hydrogen molecular ion starting from a hydrogen atom and a proton 
initially separated by a large distance. The electronic wavefunction will be a hydrogen Is 
orbital until the proton is close enough to make any significant perturbation and so we 
might guess that the molecular wavefunction should resemble an atomic Is orbital, at least 
near the appropriate nucleus. 

We therefore guess that the low-energy molecular orbitals of H 2 + might be represented as 
$ = C A ls A + C B ls B 

where the coefficients c A and c B have to be determined. This technique is called the linear 
combination of atomic orbitals (LCAO) and I have used the shorthand that 1 s A is a hydrogen 
Is orbital centred on nucleus A. In this particular case we can deduce the coefficients from 
symmetry. According to the Born interpretation, i// 2 dr gives the chance that an electron 
can be found in the volume element dr. We have 


ifr 2 dr = (c^Isa + 2c A c B ls A ls B + CglSg) dr 



Simple Molecules 


241 


Electron densities around the two hydrogen atoms have to be the same by symmetry, and 
the nuclear labels A and B can be used interchangeably, which means that 

C A = C B 

c A = ±c B 


This gives two possibilities, which I will label t/r + and V / -> and we usually write them in 
normalized form as 


V2(l+S) 


l/r_ = , (ls A — ls B ) 

* V2(l -S) K 

where S— J ls A ls B dr 

= e x p(=^)(, + ^- 
\ a 0 ) \ a 0 


(15.5) 


We can test these approximate wavefunctions by calculating the variational energies, 
which can be written 


4tt£ 0 Rab 1 ± S 
iir 


■f-f hid 

4 tcs 0 J r A 

- (l + —) eiP (-2—)) 
47re 0 a 0 ^AB \ \ a 0 J \ a 0 )) 

4 

,J' . ( ' + '“ I “P (- ) 


(15.6) 


The analytical solution of these integrals together with the overlap integral S (in Equa¬ 
tion (15.5)) is covered by the classic texts such as Eyring etal. (1944); I have just quoted the 
results. This gives us the two potential energy curves shown in many elementary quantum 
chemistry textbooks (Figure 15.3). There is a subtle point: they are often called molecular 
potential energy curves because the nuclei experience just this potential. 

The ground-state energy of a hydrogen atom is and the curves tend asymptot¬ 

ically to the correct limit. The upper curve describes an excited state whilst the lower 
curve describes the ground state. The calculated binding energy is in poor agreement with 
experiment (Table 15.1), whilst the equilibrium bond length is in modest agreement with 
experiment. 

A hydrogenic Is orbital has the form 
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Distance/atomic unit 

Figure 15.3 Potential energy curves for lowest states of H 2 + 


Table 15.1 Hydrogen molecule ion results 



D e /eV 

RJpm 

Experiment 

2.791 

106 

Simple LCAO with £ = 1 

1.76 

132.3 

Simple LCAO, best £ = 1.238 

2.25 

106 

James, elliptic 

2.772 

106 


where the orbital exponent £ = 1 for a hydrogen atom, 2 for He + and so on. The next step 
is to find the best value of £ that is appropriate for a hydrogen atom within a molecule. The 
best value turns out to be 1.238 showing that the Is orbital contracts a little on molecule 
formation. Table 15.1 shows better agreement with experiment. 


15.3 Elliptic Orbitals 

The best orbitals that have been obtained for H 2 + were found by a different approach; the 
natural coordinates to use are the elliptic coordinates /jl, v, <f> and James (1935) found that 
a good approximation to the lowest orbital is 

t = exp(-<5/r) (1 + cv 2 ) 


His results are shown in Table 15.1. 


(15.8) 
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The hydrogen molecular ion is interesting in that it does not have an electron pair yet it 
is stable. The obvious question is, can we give a hand-waving explanation for this stability, 
one that does not rely on the presence of electron pairs? 

The obvious place to look is the electron density. The electron density plots by themselves 
are not particularly informative; the ground-state wavefunction shows a region of appar¬ 
ently enhanced electron density between the nuclei, whilst the excited-state wavefunction 
has less electron density in this region. It is more interesting to look at the difference between 
the electron density in the molecule and two ground-state hydrogen atoms, each of which 
contains half an electron. Such plots are called density differences. Positive contours corres¬ 
pond to a gain of electron density (in units of e) compared to two overlapped half hydrogens; 
negative contours correspond to a loss. The molecule lies along the horizontal axis. 

The density difference plot for the ground state (Figure 15.4) illustrates an electron 
density enhancement in the region between the nuclei and so a stable molecule. Electron 
density has to be redistributed, it cannot appear from nowhere and we see that the bond 
region gains at the expense of regions beyond the nuclei. The excited-state wavefunction 
shows the opposite effect: there is a depletion of electron density in the bond region, and 
a gain of electron density beyond the nuclei (Figure 15.5). This explains the instability of 
this state, so we can give a simple electrostatic explanation for the stability of the molecule 
in its ground state, and its instability in the excited state. 

It is a general principle that, once we have calculated an electron density from the laws 
of quantum mechanics, we can analyse it by the laws of classical electrostatics. 



0-1-1-1-1-1- 

0 10 20 30 40 


Figure 15.4 Density difference for ground state 
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Figure 15.5 Density difference for excited state 


15.4 Heitler-London Treatment of Dihydrogen 

The simplest molecule of any true chemical importance is dihydrogen. The methods dis¬ 
cussed above for the hydrogen molecule ion are not applicable to dihydrogen, because 
of the extra electron and the electron-electron repulsion. The first successful treatment of 
dihydrogen was that of Heitler and London (1927). They argued as follows. Consider a 
‘reaction’ 

H a + H b -* H 2 

where the left-hand-side hydrogen atoms labelled A and B are initially at infinite separation. 
Each hydrogen atom is in its lowest electronic state and there are two spin orbitals per atom 
giving four in total: ls A a, 1 s A /f, ls B a and 1 s B /S. As we bring the atoms closer and closer 
together, eventually their orbitals will overlap significantly and we have to take account 
of the Pauli principle. We need to make sure that the total wavefunction is antisymmetric 
to exchange of electron names. There are (4!/2!) = 12 possible ways of distributing two 
electrons amongst four atomic spin orbitals but not all such distributions satisfy the Pauli 
principle. Analysis along the lines given for helium in Chapter 14 gives the following 
unnormalized building blocks (Slater determinants): 


ls A (ri) a CsO 

L’b (r,) p (s t ) 

\ D 2 = 

Ha (D/H*,) 

L’b (r,) a (v, ) 

ls A (r 2 ) ce (s 2 ) 

ls B (r 2 ) P (s 2 ) 

l.v A (r 2 ) P (s 2 ) 

1^ B (r 2 ) a (s 2 ) 

1 ls A (r 2 ) a (it) 

Isb ( r 0 a (s{) 1 

£>4 = 

1 s A ( Fl ) j9 (sO 

1 Sb fa) )8 (*) 1 

ls A (r 2 )a( ft ) 

l-s B (r 2 ) a (s 2 ) 

Ha (r 2 )/3fe) 

ls B (r 2 )^fe) 
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I h’ A (ri)a 00 lj’A (r,)/?(*,) I I h B (r,)a (s,) L B (r,)£ (s,) I 
5 | Ls A (r 2 ) a fe>) ls A (r 2 ) P fe) | 6 j ls B (r 2 ) a (s 2 ) ls B (r 2 )P(s 2 ) | 

It is useful to combine the D 's into spin eigenfunctions, since the spin operators commute 
with the molecular Hamiltonian. The advantage is that we only need take combinations 
of those wavefunctions having the same spinquantum numbers when seeking to improve 
our description of the electronic states (Table 15.2). The combination ^ — D x — D 2 is a 
singlet spin state and is said to represent the covalent bond, since it gives an equal sharing 
to the two equivalent Is orbitals by the two electrons. <^ 2 through correspond to the 
first excited state, which is a triplet spin state. They have the same energy in the absence 
of an external magnetic field. and <^ 6 are called ionic terms, because they represent an 
electron density distribution in which both electrons are associated with the same nucleus. 
Heitler and London included tf r 1 (for the electronic ground state) and <^ 2 through (for 
the excited triplet state) in their original calculation. The necessary integrals needed for a 
variational calculation are given in Heitler and London’s paper, and in the paper by Sugiura 
(1937). We often refer to the Heitler-London approach as the valence bond (VB) method 
and I will use the two descriptors interchangeably. It was the first successful treatment of 
an electron pair bond. 


Table 15.2 Dihydrogen elementary valence bond (Heitler-London) 
calculation 


<p 

Combination 

5 

M s 

Comment 


D, - D 2 

0 

0 

Covalent ground state 

4> 2 

D, + D 2 

1 

0 

Excited triplet state 

<I > 2 

d 3 

1 

1 

Excited triplet state 


d 4 

1 

-1 

Excited triplet state 


D s 

0 

0 

Ionic term 


d 6 

0 

0 

Ionic term 


The energy corresponding to 'I'i is sometimes written 
_ J + K 
£ ~ 1 + S 2 

J = I ls A (r 2 ) ls B (Ti)HIs a (r,) ls B (r 2 ) dzidr 2 

J r „ (15.9) 

K= I l.s A (r,) l.v B (r 2 )H]s A (r 2 ) l.v B (r,)dT,dr 2 

S = j LufrOLv,, (r,)dr 

and early ‘explanations’ of chemical bonding focused on the Coulomb {— J /1 + S 2 ) and 
exchange (= K/\ + S 2 ) contributions to molecular energies. Several improvements were 
made to the simple VB treatment of dihydrogen, for example treating the orbital exponent 
as a variational parameter, and inclusion of the ionic terms once again correctly weighted 
by use of the variation principle. This latter procedure is referred to as configuration 
interaction (Cl). 
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15.5 Dihydrogen MO Treatment 


The MO treatment was given by Hellmann (1937) amongst others. He took the lowest 
energy MO as a combination of hydrogen Is orbitals, as for H 2 + , 


and wrote 


V2 (1 + S) 


(\s A + 1.? B ) 


_ /T| yf+(ri) a (so \lr+ (T!) P (Si) I 

7 V 2! | ir+ (r 2 ) a fe) <A+ (r 2 ) p (s 2 ) \ 


The calculated bond length and dissociation energy are in poorer agreement with exper¬ 
iment than those obtained from the simple VB treatment (Table 15.3), and this puzzled 
many people at the time. It also led them to believe that the VB method was the correct 
way forward for the description of molecular structure; in the event, advances in com¬ 
puter technology and numerical linear algebra have proved them wrong, but that is for a 
later chapter. 


Table 15.3 Dihydrogen calculations 


Comment 

D e /eV 

R e /a 0 

Experiment 

4.72 

1.40 

Simple VB 

3.14 

1.64 

Simple MO 

2.65 

1.60 

James and Coolidge 

4.698 

1.40 


But why is the simple MO treatment of dihydrogen so poor? If we expand D 1 in terms 
of the atomic orbitals ls A and ls B we find (apart from the normalizing constant) 

£>7 = (ls A (r,) fs B (r 2 ) + b' B (r,) \s A (r 2 ) + \s A (r,) ls A (r 2 ) + l.s B (r,) b B (r 2 )) 
x (a (si) p (s 2 ) - a (s 2 ) P (sO) 

which is equal to the simple VB wavefunction but with ionic terms included and weighted 
equally to the covalent terms. The solution to the problem is to include excited states in 
the wavefunction, just as we did for helium, but with a variable weight. This process is 
configuration interaction (Cl). In the limit, once all the refinements are made, the two refined 
treatments (VB with Cl and MO with Cl) give exactly the same results and so there is no 
particular reason to prefer one rather than the other. The starting points are different, but 
not the refined end results. 

To summarize, Figure 15.6 shows the potential energy curves for the VB, LCAO and Cl 
treatments. The full curve is the VB calculation, the dotted curve the MO and the dashed 
curve is a Cl treatment. 
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Figure 15.6 Simple dihydrogen calculations 


15.6 James and Coolidge Treatment 


Just as for H 2 + , it is again found that an accurate value of the binding energy can be obtained 
by writing the wavefunction in terms of elliptic coordinates. James and Coolidge (1933) 
wrote such a wavefunction that included the interelectron distance r 12 explicitly: 


f = exp (-8 {iM + 111)) c Umnp (p![p} 2 v™v n 2 u p + /4/z*v"v> p ) 

(15.10) 


r a, i + r B ,i r A j — r B>1 2r 12 



where k, l, m, n and p are integers and the form of the function is that it is symmetric to 
interchange of electron names. Parameter 8 is the orbital exponent. In order to make the 
wavefunction symmetric in the nuclear coordinates, the authors included only those terms 
having (m + n) as an even integer. They found that a 13-term function gave essentially 
complete agreement with experiment. 


15.7 Population Analysis 


Our simple treatment of the hydrogen molecule ion was based on the bonding orbital 


which corresponds to a charge distribution of density — e\jj- + 2 . We normally use the symbol p 
for volume charge densities and to emphasize that it depends on positions in space we write 


p(r) = -e«r + (r )) 2 
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Substituting and expanding we have 

P M = ~ 2( ^ +S) (( 1s a( r )) 2 + (ls B (r)) 2 + 2 x ls A (r) x ls B (r)) 


which we can write formally in matrix language as 


P (r) = —e (ls A (r) 


ls B W) 


/_i_1_\ 

2 (1 + S') 2 (1 + S) 

1 1 
\2 (1 + S) 2(1 + 5)/ 



(15.11) 


Chemists often forget about the — e since they tend to think in positive numbers of 
electrons, and they write the middle symmetric matrix P. It is referred to as the charge 
density matrix, or the matrix of charges and bond orders. In this case we have 


/ 1 1 \ 

2(1+ S) 2(1+5) 

1 1 
\2(1+S) 2(1 + 5)/ 


From the early days of molecular quantum theory, authors have tried to divide up the 
electron density in chemically appealing ways, by allocating parts of the density to atom 
regions and parts to bond regions. If we integrate the electron density 


h 


K/< 


(ls A (r)) 2 dr + / (ls B (r)) 2 dr + 2 / ls A (r) x ls B (r) dr 


which must come to the number of electrons (1 in the case of hydrogen molecule ion) 
times —e. We therefore have, on doing the integrals and keeping the terms in order 


/ P (r) dr = - 


-(1 + 1+25) 


2(1 + 5) 

We interpret this by saying that the electron charge fractions 

-2 Se 


2(1 + 5)’ 2(1 + 5) 


and 


2(1 + 5) 


are associated with atom A, with atom B and with the bond region between atoms A and B. 
These fractions are referred to as the net atom and bond (or overlap) populations. Mulliken 
(1955) introduced the idea of population analysis, but the ideas had already been around 
for a very long time. He had the idea to divide each overlap population into equal parts 
and allocate part to each atom formally contributing to the bond. Here is the abstract of his 
famous 1955 paper: 


With increasing availability of good all-electron LCAO-MO wavefunctions for molecules, a 
systematic procedure for obtaining maximum insight from such data has become desirable. An 
analysis in quantitative form is given here in terms of breakdowns of the electronic population 
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into partial and total ‘gross atomic populations’ or into partial and total ‘net atomic popula¬ 
tions’ together with ‘overlap populations’. ‘Gross atomic populations’ distribute the electrons 
almost perfectly among the various AOs of the various atoms in the molecule. From these 
numbers, a definite figure is obtained for the amount of promotion (e.g., from 2s to 2p) in each 
atom; and also for the gross charge Q on each atom if the bonds are polar. The total overlap 
population for any pair of atoms in a molecule is in general made up of positive and negative 
contributions. 

Mulliken’s aim was to give rough-and-ready indices that characterize a molecular charge 
distribution and that can be used for comparisons between molecules. The Mulliken gross 
atom populations in our case are 


1 


S 


2(1 + 5) + 2(1+ S) 2 


15.7.1 Extension to Many-Electron Systems 

Amore careful analysis is needed when dealing with many-electron wavefunctions. Suppose 
& is an m-electron wavefunction; then it will depend on spatial coordinates and spin 
coordinates 


W = V (it, su r 2 , +,..., r m s m ) 


If I use the convention that dr is a spatial differential volume element (equal to drdydz in 
Cartesian coordinates) and dv refers to the spin variable, then the Bom interpretation is that 


I'P (r 1 ,s 1) r 2 ,S 2 , ... ,r m 5 m )| 2 drid5idr 2 d52 • ■ ■ dr m d s„ 


gives the probability of finding simultaneously electron 1 in dtidsT, electron 2 in dr 2 d.v 2 ,... 
electron m in dr,„d.v,„. Many simple physical properties such as the electric dipole moment 
depend only on the occupancy of an arbitrary space-spin differential element drd.s by any 
electron. We can find this by averaging over all electrons except electron number 1. Indeed, 
the electric dipole moment does not depend on electron spin and so we can also average 
over the electron spin. I can illustrate with reference to dihydrogen LCAO 



1A+ Oh) a Oh) V+ Or) P Oi) 
f+ (r 2 ) a fo) f+ (r 2 ) P (s 2 ) 



If we average over electron 2 (i.e. integrate with respect to its space and spin coordinates) 


we get 



and if we are not particularly interested in the spin variable we find, on integration over the 
spin of electron 1 



(15.12) 
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I have of course assumed that all the wavefunctions are real rather than complex; it would 
not have made any difference to my argument. 

This latter quantity Equation (15.12) times dri gives the probability of finding electron 
1 in dri with either spin, and the other electrons anywhere and again with either spin. Since 
there are two indistinguishable electrons in dihydrogen, the total electron density must be 
twice my result: 

P (r) = —2e{\j/ + (r)} 2 


The charges and bond orders matrix is therefore just twice what we 
molecular ion: 

/I 1 \ 

found for the hydrogen 

" B? 

% s 

(15.13) 

V 1 + 5 Wt+-S/ 

A corresponding analysis for the VB function gives 

/ 1 S \ 




(15.14) 

V 1 + S 2 

1 + s 2 / 



and the gross Mulliken population for each hydrogen nucleus is once again 1/2, just as it 
should be. 

There is a nice way to remember the result, provided you are happy with matrices. If we 
write an overlap matrix S for the two atomic orbitals ls A and ls B as 


S = 


ls A ls A dr 

ls B ls A dr 


1 


ls A ls B dr 

ls B ls B dr 


then a little analysis shows that 


Y. y, PyS,-, = number of electrons 


This can be restated in terms of the trace of the matrix product 


Tr (PS) = number of electrons (15.15) 
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16 

The HF-LCAO Model 


In this chapter I want to explain how we progress the Hartree-Fock (HF) orbital model to 
more complicated molecules. The HF model encompasses the Born-Oppenheimer approx¬ 
imation and so we glue the nuclei into place before trying to calculate the electronic 
wavefunction. Each electron experiences an average field due to the remaining electrons, 
together with the field due to the fixed nuclei. If we are interested in geometry optimization, 
we have to perform HF calculations at each point on the molecular potential energy surface. 

We can solve the relevant HF equation numerically for atoms to whatever accuracy we 
require, because of their high symmetry. In the case of a molecule, we have to resort to the 
LCAO approximation as discussed in Chapter 15. 

In the case of dihydrogen and the hydrogen molecule ion, we were able to deduce simple 
forms for the lowest energy molecular orbitals 


= V2(1±S) ( 


just on the basis of symmetry considerations, assuming that only the hydrogenic Is orbitals 
had a role to play. We have to use very many atomic orbitals (basis functions, to use 
the correct jargon) for polyatomic molecules and it is not possible to deduce the LCAO 
coefficients by hand-waving symmetry considerations. The ‘shape’ of the resulting orbitals, 
by which I mean the LCAO coefficients, is determined by the variation principle. We 
therefore have to find a variational expression for the electronic energy. 

There is a third issue, namely which basis functions should we use for a polyatomic 
molecule; we have already met hydrogenic and Slater orbitals, but it turns out that neither of 
these is suitable by themselves for molecular calculations. The reason is one of pragmatism, 
and has to do with the difficulty of calculating certain two-electron integrals that appear in 
our orbital model. More of this later in the chapter. 
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16.1 Roothaan’s 1951 Landmark Paper 

Our next landmark is the paper by Roothaan (1951). In those days papers did not always 
have a synopsis. We can learn a great deal from the first paragraph of the introduction, as 
follows. 

For dealing with the problems of molecular quantum mechanics, two methods of approx¬ 
imation have been developed which are capable of handling many-electron systems. The 
Heitler-London-Pauling-Slater or valence bond (VB) method originated from a chemical 
point of view. The atoms are considered as the material from which the molecule is built; 
accordingly, the molecular wave function is constructed from the wave functions of the indi¬ 
vidual atoms. The Hund-Mulliken or molecular orbital (MO) method is an extension of the 
Bohr theory of electron configurations from atoms to molecules. Each electron is assigned to 
a one-electron wave function or molecular orbital, which is the quantum mechanical analog 
of an electron orbit . . . 

It is the purpose of this paper to build a rigorous mathematical framework for the MO 
method. 

Within the Born-Oppenheimer approximation we consider a molecule as a set of N point 
charges of magnitudes eZ,, eZ 2 , . . . , eZ N at fixed position vectors Ri, R 2 , . . . , R ;V . Their 
mutual potential energy is 



(16.1) 


Assume for the moment that we have chosen n basis functions, which could be the 
Slater orbitals from Chapter 14. Basis functions are usually real quantities in the math¬ 
ematical sense, but complex basis functions have to be used in difficult cases such as 
when we have to deal with molecular magnetic properties. I will assume that we have 
chosen a set of real basis functions written Xi(r), ^(r), ■ ■ . , Xn( r )- The HF-LCAO method 
seeks to express each orbital i/q (r), i/r 2 (r), . . . , ijr M { r) as a linear combination of the basis 
functions 


ft = Ci,iXi + c&X2 + ■ • • + c iA )(„ 


(16.2) 


and the process gives n LCAO-MO orbitals in total. 

Roothaan’s original treatment only applies to molecular electronic states where each 
HF-LCAO is doubly occupied (so-called ‘closed shell states’). This covers the case of the 
vast majority of organic molecules in their electronic ground state, and we think of the 
electronic configuration as 


if if if2 f ■ ■ ■ (tuf 


as shown in Figure 16.1. Also, the treatment is restricted to the lowest energy state of each 
allowed symmetry type; since most organic molecules have no symmetry, this means that 
only the electronic ground state can be treated. The HF-LCAO wavefunction is a single 
Slater determinant, and we can assume without any loss of generality that the HF-LCAO 
orbitals are normalized and orthogonal. Each orbital is doubly occupied and there are 
therefore m = 2M electrons. 
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Figure 16.1 Electronic closed shell with 2M electrons 


We now need to find the variational energy, which for a real wavefunction is 


_ /1 l/HV dr 
£el_ f'T' 2 dr 


It is neatest if I express the Hamiltonian as a sum of one-electron and two-electron operators 
as in Chapter 14: 


» = E*»w+EDM 


(16.3) 


I also discussed the Slater-Condon-Shortley rules in Chapter 14. Application of these rules 
to our wavefunction gives 




The one-electron term represents the kinetic energy of the 2 M electrons and the mutual 
electrostatic potential energy of the electrons and the nuclei. The first of the two-electron 
terms is the Coulomb term; it represents the mutual electrostatic repulsion of a pair of 
electrons whose charge densities are —eijr^ and — The second of the two-electron 
terms is the exchange term, which arises because of the fermion nature of the electrons. 
These equations apply for any single determinant closed shell; they do not yet contain any 
reference to the LCAO procedure. 
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16.2 The J and K Operators 


It is sometimes useful to recast the equation as the expectation value of a sum of one-electron 
and certain pseudo one-electron operators 

s e = 2 ( r i) ^ (1) ( r i)+ J ( r i)- ( r i)) ft ( r i) dr i ( 16 - 5 ) 

The operator h m represents the kinetic energy of an electron and the nuclear attraction. The 
operators J and K are called the Coulomb and the exchange operators. They can be defined 
through their expectation values as follows: 


fa 


(r i)J (r,) x/r R (r,)dr, 


-i ifa 


(*i)g ( r i, r 2 ) irf (r 2 )dr,dr 2 


J f R (r i)K (r,) f R (r,) dr 2 = JJ f R (r,) f t (r,)g (r ls r 2 ) f R (r 2 ) (r 2 ) dr,dr 2 


The HF Hamiltonian is a one-electron operator, defined by 
h F (r) = h m (r)+J(r)- ] -k(r) 


(16.7) 

(16.8) 


where the coordinates r refer to an arbitrary electron. HF orbitals are solutions of the 
eigenvalue equation 

h ¥ (r) if (r) = ef (r) 


16.3 HF-LCAO Equations 


I am going to make use of matrices and matrix algebra for many of the derived equations, 
for the simple reason that they look neater than they would otherwise do. 

The n basis functions are usually real and usually overlap each other and so they are 
not necessarily orthogonal. Following the arguments of Chapters 13 and 14, we collect 
together the basis function overlap integrals into an nxn real symmetric matrix S that has 
typically an i,j element 

Sij = J Xt (r.) Xi (ri) dr, 

It is convenient to store the LCAO coefficients in an n x n matrix 


1 Cl.l C 2 ,i . . . C„,i ^ 

Cl,2 C 2>2 . . . C„, 2 


V C U c 2 ,„ ... c n „ / 


(16.9) 


so that the first column collects the coefficient of the first occupied HF-LCAO orbital and 
so on. If we collect together the m occupied LCAO orbitals into an n x m matrix U occ 
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( Cl,l C 2,l 
C l,2 C 2,2 


(16.10) 


then the matrix 2U occ (U occ ) T gives the charges and bond orders matrix P discussed earlier. 
The next step is to express the energy in terms of the basis functions and the matrix P. 
The one-electron contribution is 


2 E /^R (ri)£ (1) (J-i) V/r (ri) dr = 2 £ E E J * (r ^ h< " (r ^ % (ri) dr 


If we switch the summation signs on the right-hand side we recognize elements of the 
charges and bond orders matrix P 

2 Cr -‘ Cr J J Xi ( r '^ < ' ) ( r ri Xi ( r 0 dr = X) | 2 c R jC R j JX< (ri )h m (ri) Xj (ri) dr j 

= X) |p« J Xi (ri )h m (ri) Xi (ri) dr j 

Finally, on collecting together the one-electron integrals over the basis functions into an 
n x n matrix h, whose 1,7th element is 

(hi);; = J Xi (ri)/i (1) (ri) Xi (ri) dr, 

then the one-electron term emerges as the trace of the matrix product Pl^ 


rtPHO 


(16.11) 


A corresponding analysis shows that the two-electron terms can be written as 




where the elements of the matrices J and K depend on those of P in a more complicated 
way: 


J ij = J2J2 P ki JJ Xi (ri )Xi (ri) 8 (ri.ri) Xk (ri) Xi (ri) dr,dr 2 
= p ki JJ Xi (ri)Xr (ri) g (ri, ri) Xj (ri) Xi (ri) dr,dr 2 


(16.12) 


Many authors collect together these Coulomb and exchange matrices into a composite 
called the electron repulsion matrix : 


G = J - -K 
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The electronic energy therefore comes out ai 


(16.13) 

= Tr (Ph,) + ^Tr (PG) 

We now examine how the electronic energy changes when P changes by a small amount 
<5P (equivalent to asking how e el changes as the LCAO coefficients change). We let 

P^P+8P (16.14) 

and after a little manipulation find the first-order change in the electronic energy 
8e el = Tr (h,8P) + Tr (GSP) 

= Tr ( hF8P ) (16.15) 

where I have defined the Hartree-Fock Hamiltonian matrix 

h F = hj + G (16.16) 


We want to find <5P so that <5e d is zero; the trivial solution is when <5P equals the zero 
matrix, but there obviously have to be restrictions on the allowed form of <5P. I noted above 
the requirement that the HF-LCAO orbitals were normalized and orthogonal. If we collect 
together all the overlap integrals over basis functions into the matrix S then a little matrix 
manipulation will establish that 

U r SU = 1 


PSP = 4P 


The condition that the HF-LCAO orbitals are normalized and orthogonal is equivalent to 
the matrix equation PSP= 4P and the modified P must also satisfy this condition 

(P + <5P) S (P + <5P) = 4 (P + <5P) 

A little manipulation shows that, at the energy minimum 

h F P = Ph F (16.17) 

and whilst this does not actually help find the electron density at the minimum, it gives a 
condition that has to be satisfied. 

Roothaan solved the problem in a different but equivalent way; he let the HF-LCAO coef¬ 
ficients vary subject only to the condition that the HF-LCAO orbitals remained normalized 
and orthogonal. He demonstrated that the coefficients c, are given by the generalized matrix 
eigenvalue problem 

h F C, = £;SC; (16.18) 

The HF matrix isnxn and there are exactly n solutions to the eigenvalue problem. The 
lowest energy m solutions correspond to the occupied HF-LCAO orbitals. The energy e is 
called the HF orbital energy, and for each value of s there is a column vector of HF-LCAO 
coefficients. Once again, this does not help us to find the coefficients, because they are 
contained within h F so that an iterative procedure is necessary. 
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16.3.1 The HF-LCAO Equations 

For the record, since I will need to refer to the HF Hamiltonian many times, here they are 
(for the closed shell system as shown in Figure 16.1, and assuming real basis functions): 




(16.19) 



1 

2 


Over the years, many workers have devised iterative procedures for solving the problem. 
The simplest procedure is as follows. 

• Choose a molecular geometry. 

• Choose a suitable basis set. 

• Calculate all integrals over the basis functions and store them. 

• Make an educated guess at the HF-LCAO coefficients U r>cc . 

• Calculate P and h F (the time-consuming step). 

• Solve the matrix eigenvalue problem to give the new U occ . 

• Check for convergence (test e d and/or P). 

• Exit or go back three steps. 

There are other procedures. I will give you a numerical example later, and explain some 
of the methods that people use in order to speed up their calculation. Naturally, having 
progressed this far in the text, you will know that the HF-LCAO calculation simply gives 
one point on a molecular potential energy surface, as defined by the Born-Oppenheimer 
approximation. If your interest in life is molecular geometry optimization then you will 
have to follow the same kind of procedures as with MM in Chapter 5; there is a very big 
difference in that MM energies can be calculated in almost no time at all, whilst HF-LCAO 
energies consume considerable resource. 


16.4 Electronic Energy 

As noted in Equation (16.4), the HF-LCAO electronic energy is given by 


e el = Tr (Ph,)+ -Tr(PG) 
= Tr (Ph F ) - ] -Tr (PG) 


The HF-LCAO matrix eigenvalue equation is h F c = eSc, and the lowest energy m solutions 
determine the electronic ground state of a closed shell molecule. The sum of orbital energies 
e orb is therefore 


£orb = 2 (£a + £b + • • • + Em) 
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Ifh F c A = £ A Sc A 
(c a ) t h F c A = £ A (c A ) T Sc A 
_ (c A ) T h F c A 
(c a ) t Sc a 
= (c a ) t h F c A 

and a little manipulation shows that 

£ei = £ 0 rb - '-Tr (PG) ( 16 . 20 ) 

Orbital energies do not sum to give the total electronic energy. I have written the formula 
in matrix terms for convenience when working in the LCAO approximation. The result is 
true for all HF wavefunctions, whether they are numerical atomic ones (and therefore at 
the HF limit) or just rough and ready ones that have been through the HF treatment but are 
nowhere near the HF limit. 


16.5 Koopmans’Theorem 

Koopmans’ theorem relates ionization energies to HF orbital energies, as discussed already 
for the atomic HF case. It is valid for any closed shell HF wavefunction, no matter how 
good or bad. It is subject to the following small print: the orbitals of the parent molecule and 
cation must remain unchanged on ionization. The fact is that the electron density invariably 
reorganizes on ionization, but Koopmans’ theorem calculations have been widely used to 
interpret photoelectron spectra. 


16.6 Open Shell Systems 

The vast majority of organic molecules have electronic configurations of the type 
(W 2 ■ ■ ■ Wm ) 2 

that can be described by the closed shell version of HF-LCAO procedure detailed above. 
A good deal of chemistry is concerned with excited triplet states, free radicals, cations and 
anions where there is at least one unpaired electron, and the procedure outlined above is 
not appropriate because the energy formula is incorrect. 

The HF-LCAO procedure is a chemically attractive one since it gives a clear orbital 
picture and it turns out that several classes of open shell molecules can still be treated using 
modified forms of the basic theory. The simplest case is shown in Figure 16.2. We have a 
number n, — M of doubly occupied orbitals and a number n 2 = P — (M + 1 )N of singly 
occupied orbitals, all with parallel spin electrons. 

We refer to this as the restricted open shell ( ROHF ) model. The closed shell energy 
formula of Equation (16.4) has to be modified, and for the sake of neatness I will introduce 
the occupation numbers v, (= 2) and v 2 (= 1) for the two shells. If I use i and j as indices 
for the doubly occupied orbitals and u and v for the singly occupied orbitals we find 
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Figure 16.2 Restricted open shell FIF 


Sd = vt J drj +^-v l J2J2( y ff '/ / f( r ^S(r,,r z )il/f(r 2 ) dz^ 

~ \ ff V r «( r i)Vo( r i)S'(r ! , r 2 )^(r 2 )Vo( r 2) d^dr^ 

+ v 2 ^ j^ r i)h m (r 1 )ir u (r l ) dr, + * i’ 2 (//V^( r i)K r i,r 2 )i/^(r 2 ) dT,dr 2 

- JJ V r «( r i)l/^( r i)iK r i> r 2)V f »( r 2)V r v( r 2) dtidr 2 ^ (16.21) 

+ VlV2 ^IZ (/ f ^> 2 ( r| )^( r| ’ r 2)^ 2 (r 2 ) dT,dr 2 

JJ V f /( r i)lW r i)tK r i. r 2 )^fj(r 2 )i/f„(r 2 ) dr,dr 2 ^j 


In the LCAO variation, we introduce n basis functions. We can collect the HF-LCAO 
coefficients of the «i doubly occupied orbitals into columns of a matrix U, (n x «,) and the 
coefficients of the « 2 singly occupied orbitals into columns of U 2 (n x n 2 ) and define the 
n x n density matrices 

Ri = 

R 2 = U 2 UJ 

Repeating the analysis given for closed shell states gives 

£ e i = ^Tr ^Ri ^ + u 2 Tr (r 2 (h, + ^G 2 ^ 


( 16 . 22 ) 
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where the two G matrices contain Coulomb and exchange contributions similar to those 
defined by Equation (16.12) for the closed shell case. We then allow Ri and R 2 to vary, 
subject to orthonormality, just as in the closed shell case. Once again, Roothaan (1960) 
showed how to write a Hamiltonian matrix whose eigenvectors give the columns Ui and 
U 2 above. 


16.7 Unrestricted Hartree-Fock (UHF) Model 

The methyl radical CH 3 can be easily produced by breaking the C-C bond in ethane: 
C 2 H 6 -> 2CH 3 

It has nine electrons and there are two ways we can progress, as shown in Figure 16.3. 



4 

4 







1 




ROHF 

UHF 

Figure 16.3 

Methyl radical 


First we can insist that the lowest energy molecular orbitals are doubly occupied, with the 
extra electron occupying the highest energy orbital (ROHF). On the other hand we can let 
the a and fi spin electrons occupy different spatial orbitals (the unrestricted Hartree-Fock 
(UHF) model). 

The treatment will be familiar to you by now. We write a single Slater determinant but 
allow the p a and the q fi spin electrons to have different spatial orbitals. Application of the 
Slater-Condon-Shortley rules gives an energy expression in terms of the HF orbitals as 
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£ a=J2 /( r i)^ (1) ( r i) K ( r i) dT i J2 J ( r i)^ (I) ( r i) f? ( r i) dr i 

+ 1212 jj ^‘ (ri) (ri) 8 (r '’ r2 ^“ ( r 2> *r ( r 2> dr ' dT2 

- \ 1212 JjW (r,) 't'i s (ri, r 2 ) f“ (r 2 ) tj (r 2 ) dr,dr 2 (16.23) 

+ J2Y1 fl^i (F|) 8 (r 2 , r 2 )Vf/ (r 2 ) ff (r 2 ) dr,dr 2 

- \ 12 J2 ff^‘ (ri) $ (Fl) 8 (r >>^ H (r ^ dr ' dr2 

There are no cross terms between the two sets of orbitals because of the orthogonality of the 
spin functions. We now introduce the LCAO concept; we expand each set of HF orbitals 
in terms of the same basis set Xi, Xz> ■ ■ ■ » Xn and form two density matrices, one for the 
a-spin electrons P“ and one for the /6-spin electrons P p in the obvious way. We finally 
arrive at two linked HF-LCAO Hamiltonian matrices; the a-spin matrix has elements 

^■ a = f Xi ( ri)^ (1) (rO Xj (l) dr 

+ EE {P a u + Pm) ffXi (r.)x, (ri) g (r,,r 2 ) X t (r 2 ) Xi (r 2 ) dr,dr 2 (|6 24) 

“EE P a M IIXi (F |)Xk (Fi) g (r,,r 2 ) Xi (r 2 ) Xi (r 2 ) dr,dr 2 

with a similar expression for the fi electrons. The UHF-LCAO orbitals are found by any 
of the standard techniques already discussed, for example, repeated construction of the 
density matrices, the two Hamiltonians, matrix diagonalization and so on until consistency 
is attained. The orbital energies can be associated with ionization energies using an extension 
of Koopmans’ theorem. 


16.7.1 Three Technical Points 

There are three technical points of which you should be aware. 

1. I chose the example CH 3 with some care, to make sure that the electronic ground state 
could be written as a single Slater determinant. There are some electronic states of very 
simple molecules where this is not possible, and more advanced considerations apply; 
for example, the first excited singlet state of dihydrogen cannot be represented as a single 
Slater determinant. 

2. The UHF method does not generally give wavefunctions that are eigenfunctions of the 
spin operator S 2 . The methyl radical UHF wavefunction will actually be a mixture of 
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various spin components ranging from the (correct) doublet state (spin quantum number 
s = 1/2) through in steps of 1 to the spin state with s = 9/2 


(16.25) 


tf'uHF = C 2 «UA + C4^=3/2 +••• 


and very often it happens that the ‘correct’ spin state dominates (c 2 ~ 1 in this case, a 
doublet) with the largest impurity coming from the next highest spin state (c 4 in this case, 
a quartet). There are two strategies: the easier option is to run the UHF calculation and 
eliminate the largest contaminant after the UHF wavefunction has been found; the more 
correct but very much more complicated procedure is to eliminate all spin contaminants 
from the wavefunction before performing the UHF calculation. The second option leads 
to a linear combination of Slater determinants, such as that given in Equation (16.25), 
where we have to find the HF-LCAO coefficients and also the expansion coefficients 
c 2 , c 4 and so on. Such extended Hartree-Fock (EHF) calculations are very expensive in 
computer resource, and I will return to this idea in Chapter 19. 

3. The ‘charges and bond orders matrix’ concept needs to be generalized to cover the case 
of open shells, because some orbitals are singly occupied (in the ROHF model) and 
because the spatial parts of the spinorbitals can be different (in the case of the UHF 
model). We collect the LCAO coefficients for the occupied a and fi spin electrons 
separately into two matrices that I will call U“ and U /f . Each matrix product U"(U") T 
and U /! (U /, ) T gives us the charges and bond orders matrices for the individual a- and 
/3-spin electrons. These matrices are sometimes referred to as P“ and P /< . The usual 
charge density P is the sum of these, and the difference is called the spin density Q. 


16.8 Basis Sets 

The stumbling block for molecular HF applications was the very large number of difficult 
integrals, especially those of the two-electron kind. A typical two-electron integral over 
basis functions 



might involve four different basis functions, all of which might be on different atomic 
centres, or not even on atomic centres. Generally there will be a lot of them, there is no 
obvious choice of coordinate origin, the integral has a singularity (that is to say, it becomes 
infinite as the two electrons approach each other) and it is six dimensional. Authors therefore 
spoke of the integral bottleneck, and the bottleneck was not broken until the early 1950s. 

16.8.1 Clementi and Raimondi 

Slater’s atomic orbital rules were the first step to a correct treatment of basis functions that 
could be used in molecules. Clementi and Raimondi (1963) refined Slater’s empirical ideas 
by performing atomic HF-LCAO calculations on atoms with atomic number 2 through 36, 
in order to calculate the variationally correct orbital exponents. Their synopsis is worth 
reading. 
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The self-consistent field functions for atoms with 2 to 36 electrons are computed with a 
minimum basis set of Slater-type orbitals. The orbital exponents of the atomic orbitals are 
optimized as to ensure the energy minimum. The analysis of the optimized orbital exponents 
allows us to obtain simple and accurate rules for the Is, 2s, 3s, 4s, 2p, 3p, 4p and 3d electronic 
screening constants. These rules are compared with those proposed by Slater and reveal the 
need for the screening due to the outside electrons. The analysis of the screening constants 
(and orbital exponents) is extended to the excited states of the ground state configuration and 
the positive ions. 

What they did, starting from Slater’s ideas of Is, 2s, 2p, . . . atomic orbitals with modified 
orbital exponents (effective nuclear charges), was as follows for each atom in their set. 

• Decide on the electronic ground-state configuration and choose starting values of the 
orbital exponents from Slater’s rules. 

• Optimize each orbital exponent individually by the HF-LCAO procedure. At the end of 
each optimization, the earlier optimized values will have changed and so . . . 

• Check for self-consistency amongst the orbital exponents and either exit or go back 
one step. 

There are better optimization procedures, as you will know from reading earlier chapters, 
but that is how the early workers did things. 

Some of the results of Clementi and Raimondi are given in Table 16.1. We call such basis 
sets single zeta or minimal because they use exactly the same number of atomic orbitals as in 
descriptive chemistry. For each atom there is just one Is orbital, one 2s, three 2p and so on. 


Table 16.1 Comparison of Slater's exponents with those of Clementi and Raimondi (CR) 


Atom 

CRIs 

Slater 

CR 2s 

CR 2p 

Slater value 


exponent 

value for Is 

exponent 

exponent 

for 2s/2p 

H 

He 

1.6875 

1.70 




Li 

2.6906 

2.70 

0.6396 


0.650 

Be 

3.6848 

3.70 

0.9560 


0.975 

B 

4.6795 

4.70 

1.2881 

1.2107 

1.300 

C 

5.6727 

5.70 

1.6083 

1.5679 

1.625 

N 

6.6651 

6.70 

1.9237 

1.9170 

1.950 

O 

7.6579 

7.70 

2.2458 

2.2266 

2.275 

F 

8.6501 

8.70 

2.5638 

2.5500 

2.600 

Ne 

9.6241 

9.70 

2.8792 

2.8792 

2.925 


16.8.2 Extension to Second-Row Atoms 

Clementi (1964) extended this treatment to the second row in his 1964 paper, and he wrote 
the following in his abstract. 

The self-consistent field functions for the ground state of the first and second row atoms (from 
He to Ar) are computed with a basis set in which two Slater-type orbitals (STO) are chosen 
for each atomic orbital. The reported STOs have carefully optimized orbital exponents. The 
total energy is not far from the accurate Hartree-Fock energy given by Clementi, Roothaan 
and Yoshimine for the first row atoms and unpublished data for the second-row atoms. The 
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obtained basis sets have sufficient flexibility to be a most useful starting set for molecular 
computations, as noted by Richardson. With the addition of 3d and 4f functions, the reported 
atomic basis sets provide a molecular basis set which duplicates quantitatively most of the 
chemical information derivable by the more extended basis sets needed to obtain accurate 
Hartree-Fock molecular functions. 

Clementi thus doubled the number of atomic orbitals and so used two slightly different 
Is orbitals, two sbghtly different 2s orbitals and so on. Once again, he optimized the 
orbital exponents by systematic variations and a small sample from his results is shown in 
Table 16.2. We refer to such a basis set as a double zeta basis set. Where the single zeta 
basis set for atomic lithium has a Is exponent of 2.6906, the double zeta basis set has two 
Is orbitals with exponents 2.4331 and 4.5177 (the inner and outer Is orbitals). 


Table 16.2 A selection from dementi's double zeta basis set 


Atom 

1 s exponents 

2s exponents 

2p exponents 

He 

1.4461 




2.8622 



Li 

2.4331 

0.6714 



4.5177 

1.9781 


Be 

3.3370 

0.6040 



5.5063 

1.0118 


B 

4.3048 

0.8814 

1.0037 


6.8469 

1.4070 

2.2086 

C 

5.2309 

1.1678 

1.2557 


7.9690 

1.8203 

2.7263 

N 

6.1186 

1.3933 

1.5059 


8.9384 

2.2216 

3.2674 

O 

7.0623 

1.6271 

1.6537 


10.1085 

2.6216 

3.6813 

F 

7.9179 

1.9467 

1.8454 


11.0110 

3.0960 

4.1710 

Ne 

8.9141 

2.1839 

2.0514 


12.3454 

3.4921 

4.6748 


The lowest energy HF-LCAO atomic orbital for a lithium atom will be a combination of 
the four s-type basis functions, and we call this combination ‘the’ atomic Is orbital. 

16.8.3 Polarization Functions 

There is a second point to notice from dementi’s paper, where he speaks about ‘the addition 
of 3d and 4f functions . . . ’ with reference to first-and second-row atoms respectively. Any 
chemist would write the fluorine electronic ground state configuration 

F : Is 2 2s 2 2p 5 

and so the d and f orbitals are unoccupied. When atoms combine to form molecules the 
atomic charge density may well distort from spherical symmetry and such polarization 
functions are needed to accurately describe this distortion. 
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16.9 Gaussian Orbitals 


None of these considerations actually broke the integrals bottleneck. There are two con¬ 
siderations. First is the sheer number of integrals that have to be processed. If we run a 
HF-LCAO calculation with n basis functions, we have to calculate p = \n(n + 1) one- 
electron integrals of each type (overlap, kinetic energy and electron-nuclear attraction). 
If n — 100 then we have to calculate 15 150 one-electron integrals. It is the two-electron 
integrals that cause the problem, for we have to calculate at most q = \p(p + 1) of them; if 
n = 100 then q = 1.25 x 10 7 . There are approximately \n A integrals (for large n), although 
this is to be seen as an upper limit. Many of the integrals turn out to be so tiny that they 
can be neglected for practical calculations on large molecules. 

Second is the mathematical intractability of the integrals, as mentioned above. The real 
breakthrough came in the 1950s with the introduction of ( Cartesian) Gaussian basis func¬ 
tions', these are similar to Slater orbitals but they have an exponential factor that goes as 
the square of the distance between the electron and the orbital centre: 


Xc = N G x l y m z" exp 


(-“0 


(16.26) 


Here l, m and n can have any integral values and the orbital exponent a is positive. If all of 
/, m and n are zero we talk about a ls-type Gaussian-type orbital (GTO). If one of l, m or 
n is equal to 1 we have a 2p-type GTO. Expressions for normalized Is and 2p r GTOs are 



When dealing with d-type (where / + m + n = 2) we note that there are six possibilities, 
xx, xy, xz, yy, yz, zz, rather than the five combinations we normally encounter for Slater-type 
orbitals (STOs). The combination (x 2 + y 2 + z 2 ) actually gives a 3s GTO, but we normally 
include all six Cartesian Gaussians in calculations. Similar considerations apply to the f, 
g, . . . orbitals. 

Why use Gaussians? Letme illustrate the answer by considering the two one-dimensional 
normal distribution curves (which are one-dimensional Gaussians) shown in Figure 16.4: 

G a (x) = exp (—a A (x - x A f) 

G b (x) = exp (-a B (x - x B f) 


I have taken a A = 0.1, x A = 1 (the full curve) and a B = 0.3 and x B = —2 (the dotted 
curve). It is easily shown that the product C A (x) x G B (x) is another Gaussian G c whose 
centre lies between G A and G B : 


G c (x) = exp ( -—— {x A - x B } \ 

V a A + a B J 

a A x A + a B x B 

x c = --- 

a A + a A 


exp (- (a A + a B ) (x - x c ) 2 ) 


(16.27) 


which is shown as the dashed curve in Figure 16.4. This is a general property of Gaussians. 
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Figure 16.4 Product of two one-dimensional Caussians 


Credit for the introduction of Gaussian basis functions is usually given to S.F. Boys for 
his 1950 paper, and here is the famous synopsis (Boys 1950). 

This communication deals with the general theory of obtaining numerical electronic wave- 
functions for the stationary states of atoms and molecules. It is shown that by taking Gaussian 
functions, and functions derived from these by differentiation with respect to the parameters, 
complete systems of functions can be constructed appropriate to any molecular problem, and 
that all the necessary integrals can be explicitly evaluated. These can be used in connection 
with the molecular orbital treatment, or localized bond method, or the general method of linear 
combinations of many Slater determinants by the variation procedure. This general method of 
obtaining a sequence of solutions converging to the accurate solution is examined. It is shown 
that the only obstacle to the evaluation of wavefunctions of any required degree of accuracy 
is the labour of computation. A modification of the general method applicable to atoms is 
discussed and considered to be extremely practicable 

GTOs have one great advantage over STOs; the nasty integrals (especially the two- 
electron integrals) we need for molecular quantum mechanics are relatively straightforward 
because they can be reduced from at most a four-centre integral to a one-centre integral 
by repeated application of the principle above. Consider for example an overlap integral 
between the two s-type GTOs shown in Figure 16.5. This is a three-dimensional extension 
to the one-dimensional problem discussed above. 

Gaussian A has exponent a A and is centred at r A ; Gaussian B has exponent a B and is 
centred at r B . If the position vector of the electron is r then its position vector relative to 
the centre of G A is r - r A , with a similar expression for G B . The overlap integral (apart from 
the normalizing constants N and N') is 
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Origin 

Figure 16.5 Overlap integral between two CTOs 


= N ft exp a A —— 

/ |r — r c | : 

= N J exp I —a c - 2 — 

The product GTO G c has exponent a c and centre r c given by 


( l r — rB l 

p -a B - 5 — d 

V «o / 


a c = a A + a B 
1 

r c = -;-(«A r A + « B r B ) 

«A + « B 

The remaining integral is a product of three standard integrals 



dy/exp (_„ c (£^f^ dx 


One major problem that concerned early workers is that GTOs do not give terribly good 
energies. If we try a variational calculation on a hydrogen atom with a single s-type GTO 




and calculate the optimal Gaussian exponent, we find a op , = 0.283 with a variational 
energy of —0A24E h . The experimental energy is —0.5£ h and the error is some 15%. A 
second problem is that the shapes of GTOs and STOs are quite different. Figure 16.6 shows 
the dependences oftheSTO (with exponent £ = 1) and the GTO (with exponent a = 0.283) 
on distance for a hydrogen atom. The plot is of wavefunction/do 3/2 versus distance from 
the nucleus, rla 0 . The full curve is the STO, the dashed curve the best GTO. 
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Distance/aO 

Figure 16.6 GTO versus STO for hydrogen atom 


GTOs show the wrong behaviour at the nucleus, where they should have a cusp because 
the mutual potential energy of the electron and the nucleus becomes infinite as the distance 
becomes zero. GTOs also fall off far too quickly with the distance from the nucleus. 

16.9.1 STO/nG 

The next step was to address the long-distance behaviour, and Hehre, Stewart and Pople 
proposed the idea of fitting a fixed linear combination of n GTOs to a given STO. The GTOs 
are not explicitly included in a HF-LCAO calculation, they are just used to give a good fit 
to an STO for integral evaluation. The resulting HF-LCAO orbitals can be thought of as 
minimal basis STOs. The GTOs are called primitive GTOs, and we say that the resulting 
atomic (STO) orbital is contracted. So, for example, we would use least squares fitting 
techniques to find the best three primitive GTO exponents a, and contraction coefficients 
d t in the STO/3G fit to a Is STO orbital with exponent 1: 

STO (f = 1) = ^GTO ( ai ) + d 2 GTO (a 2 ) + d 3 GTO (a 3 ) (16.29) 

The next keynote paper is therefore that of Hehre et al. (1969). As usual I will let the 
authors explain their ideas. 

Least Squares representations of Slater-type atomic orbitals as a sum of Gaussian-type orbitals 
are presented. These have the special feature that common Gaussian exponents are shared 
between Slater-type 2s and 2p functions. Use of these atomic orbitals in self-consistent 
molecular-orbital calculations is shown to lead to values of atomization energies, atomic pop¬ 
ulations, and electric dipole moments which converge rapidly (with increasing size of the 
Gaussian expansion) to the values appropriate for pure Slater-type orbitals. The f exponents 
(or scale factors) for the atomic orbitals which are optimized for a number of molecules are 
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also shown to be nearly independent of the number of Gaussian functions. A standard set of £ 
values for use in molecular calculations is suggested on the basis of this study and is shown 
to be adequate for the calculation of total and atomization energies, but less appropriate for 
studies of the charge distribution. 

As we increase the number of primitive GTOs in the expansion, the resultant looks more 
and more like an STO, except at the nucleus where it can never attain the correct shape (the 
cusp). I show the comparison in Figure 16.7 for the STO/3G basis set. 



Distance/aO 

Figure 16.7 The STO/3C expansion 


We therefore regard a minimal molecular basis set as comprised of STOs, except for 
integral evaluation where we use a linear expansion of n GTOs. 

Many molecular properties depend on the valence electrons rather than the shape of the 
wavefunction at a nucleus, two exceptions being properties such as electron spin resonance 
and nuclear magnetic resonance parameters. 

Tables of exponents and expansion coefficients are given in the original reference, and 
these all refer to an STO exponent of 1. These original GTO basis sets were ‘universal’ in 
that they applied to every atom irrespective of the atomic configuration; to convert from the 
STO exponent 1 to an exponent £ you simply multiply the primitive exponents by f 3/2 . For 
reasons of computational efficiency, all basis functions in a given valence shell are taken 
to have the same primitive GTOs (but with different contraction coefficients). 

16.9.2 STO/4-31G 

Such STO/nG calculations were common in the literature of the 1970s. It soon became 
apparent that they give poor results in a number of circumstances. There are particular 
problems for molecules containing atoms towards the end of the first period, such as oxygen 
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and fluorine, where they give poor bond lengths and just about every other property you 
can think of. Eventually it was realized that whilst most of the energy comes from the inner 
shell regions, some flexibility ought to be given to the valence regions. The valence orbitals 
are therefore split into (n — 1) primitives and one primitive so we represent a hydrogen 
atom as two basis functions as shown in Table 16.3. We think of an inner (3 GTO) and an 
outer (1 GTO) basis function. 


Table 16.3 STO/4-31C hydrogen atom basis functions 


Orbital exponent 

Contraction coefficient 

13.00773 

0.0334960 

1.962079 

0.22472720 

0.4445290 

0.8137573 

0.1219492 

1 


For other atoms, the inner shell basis functions are left in an STO/nG contraction. Again, 
you might like to read the synopsis of the keynote paper by Ditchfield et al. (1971). 

An extended basis set of atomic functions expressed as fixed linear combinations of Gaussian 
functions is presented for hydrogen and the first row atoms carbon to fluorine. In this set, 
described as 4-3 IQ each inner shell is represented by a single basis function taken as a sum 
over four Gaussians and each valence orbital is split into inner and outer parts described by three 
and one Gaussian function respectively. The expansion coefficients and Gaussian exponents are 
determined by minimizing the total calculated energy of the electronic ground state. This basis 
set is then used in single-determinant molecular-orbital studies of a group of small polyatomic 
molecules. Optimization of valence-shell scaling factors shows that considerable rescaling of 
atomic functions occurs in molecules, the largest effects being observed for hydrogen and 
carbon. However, the range of optimum scale factors for each atom is small enough to allow 
the selection of a standard molecular set. The use of this standard basis gives theoretical 
equilibrium geometries in reasonable agreement with experiment. 

16.9.3 Gaussian Polarization and Diffuse Functions 

I mentioned polarization functions briefly in Section 16.8. The best thing is for me to quote 
the synopsis of a keynote paper by Collins et al. (1976) at this point. 

Three basis sets (minimal s-p, extended s-p and minimal s-p with d functions on the second 
row atoms) are used to calculate geometries and binding energies of 24 molecules containing 
second row atoms, d functions are found to be essential in the description of both properties 
for hypervalent molecules and to be important in the calculations of two-heavy-atom bond 
lengths even for molecules of normal valence. 

The addition of a single set of polarization functions to a heavy atom STO/4-31G basis set 
gives the so-called STO/4-31G*, and further addition of (p-type) polarization functions to 
hydrogen gives STO/4-31G** in an obvious notation. There are more explicit conventions 
when using more than one set of polarization functions per atom. 

Polarization functions essentially allow spherical atomic charge distributions to dis¬ 
tort on molecule formation or in the presence of external electric fields. In order to 
treat species that carry formal negative charges or deal with molecular properties that 
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depend on regions of the valence shells that are far from the nucleus, it is necessary 
to include diffuse basis functions (primitives with very small exponents), and they are 
denoted + and ++. 


16.9.4 Extended Basis Sets 

Huzinaga’s set of uncontracted GTOs provides the classic example for our next topic. These 
large sets of primitive (uncontracted) GTOs comprise ten primitive s-type and six primitive 
p-type, for a first-row atom. The orbital exponents were carefully optimized for every first- 
row atom and for hydrogen. There is no reason in principle why we should not use them 
as they stand for molecular calculations but the contraction process is found to give great 
computational efficiency with little cost in energy. What is needed is a way of effectively 
grouping them together for a molecular calculation, i.e. a contraction scheme. 

Many authors performed HF-LCAO calculations on atoms and small molecules, and 
looked for groupings of the primitive GTOs that did not change their relative weight¬ 
ings from orbital to orbital and from molecule to molecule. I can illustrate these ideas by 
mentioning Dunning’s (1975) work, with the by-now inevitable synopsis. 

Contracted [5s3p] and [5s4p] Gaussian basis sets for the first-row atoms are derived from 
the (10s6p) primitive basis sets of Huzinaga. Contracted [2s] and [3s] sets for the hydrogen 
atom obtained from primitive sets ranging from (4s) to (6s) are also examined. Calculations on 
the water and nitrogen molecules indicate that such basis sets when augmented with suitable 
polarization functions should yield wavefunctions near the Hartree-Fock limit. 

Dunning’s notation and ideas can be explained with the example in Table 16.4, for the 
oxygen atom. 


Table 16.4 Dunning's [5s3p] contraction scheme for Huzinaga's 
(10s6p) CTO set 


GTO type 

Exponent 

Contraction 

coefficient 

s 

1805.0 

0.000757 


2660.0 

0.006066 


585.7 

0.032782 


160.9 

0.132609 


51.16 

0.396839 


17.90 

0.542572 

s 

17.90 

0.262490 


6.639 

0.769828 

s 

2.077 

1 

s 

0.7736 

1 

s 

0.2558 

1 

p 

49.83 

0.016358 


11.49 

0.106453 


3.609 

0.349302 


1.321 

0.657183 

p 

0.4821 

1 

p 

0.1651 

1 
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The first six primitive GTOs with exponents 1805.0 through 0.2558 contribute mostly to 
what we call ‘the’ atomic Is orbital. The two most diffuse s functions (those with exponents 
0.7736 and 0.2558) are the main components of what we call the 2s STO, and they are 
allowed to vary freely in molecular calculations. The Is primitive with exponent 2.077 
turns out to make substantial contributions to both the atomic Is and 2s orbitals, so that 
one is left free as a separate basis function. 

A typical package such as Gaussian98 will have very many basis sets as part of their 
database; you do not have to rediscover the wheel. On the other hand, some basis sets are 
good for one particular application and some are poor and there are implications of cost. 
The larger the basis set the higher the cost of the calculation, and the proportionality is far 
from linear. Choice of basis set is a specialist subject, just like many others in our subject; 
you simply have to take advice and look up recent literature citations. 
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HF-LCAO Examples 


I am going to start with L-phenylanine, as in Chapters 4-6, to exemplify the principles 
discussed so far for the HF-LCAO model. There are a number of commercial packages that 
perform (amongst other things) HF-LCAO calculations, and everyone has their favourites. 
All the packages will do the simplest types of application whilst certain algorithms are only 
found in a subset of the available packages. 

I am going to use mostly Gaussian 03 (http://www.gaussian.com) from this point on, but 
I stress that it is to a large extent a matter of personal choice and what is available on your 
local computer facilities. 

The simplest way for me to run a job on my local Unix system, given that I already have 
Protein Data Bank (.pdb) files from earlier chapters, is to edit a .pdb file to produce a ‘Gaus¬ 
sian job file’ (these have extensions .gjf) and run the job. In a typical Unix environment I 
might type (after the $ prompt) 

-bash-3.00$ module load gaussian 

-bash-3.00$ g03 <phenylanine.gjf >phenylanine.out 

A suitable .gjf file is given below. 

%chk=phenylanine.chk 
# HF/6-311G** Pop=Full SCF=Direct 
L-phenylanine 
0 1 

N 0.000 0.000 0.000 

H 0.000 0.000 1.010 

C 1.366 0.000 -0.483 


Molecular Modelling for Beginners, Second Edition Alan Hinchliffe 
© 2008 John Wiley & Sons, Ltd 



276 Molecular Modelling for Beginners 


etc. etc. 

C 6.224 1.487 -1.290 
H 7.262 1.551 -1.617 


Calculations at this level of theory are iterative and resource intensive. The first statement 
sets up a checkpoint file that is used to store information about the calculation in hand, 
together with the results of the calculation. Calculations can be restarted from checkpoint 
files. 

Next comes the ‘route’ through the package. I have chosen a closed-shell HF-LCAO 
calculation using the standard STO/6-311G** basis set. The major part of resource con¬ 
sumption for a HF-LCAO calculation is concerned with calculating and manipulating the 
two-electron integrals of which there a maximum of (about) n 4 /8, where n is the number 
of basis functions. A number of strategies have been developed for dealing with them in 
an efficient manner. In the early days it was usual to calculate all the integrals once, before 
the HF iterations began, and store them on magnetic tape or exchangeable disk storage 
(EDS). At each iteration, the integrals were read back into computer memory, the HF mat¬ 
rix constructed and diagonalized. Matrix diagonalization is ‘only’ an n 2 process and even 
the old-fashioned Jacobi or Householder algorithms are still found in modern modelling 
packages. 

It usually happens that many of the two-electron integrals are negligibly small, and 
it is possible to estimate whether an integral will fall into this category before actually 
calculating it. Only the significant ones then have to be calculated and stored. It is usual 
to store the two-electron basis function label i, j, k, l packed as bytes into a 32-bit word, 
together with the value of the integral (often multiplied by an appropriate factor of 10 and 
stored as an integer). 

The maximum number of basis functions determines the amount of storage needed for 
the label; using one byte for each index limits us to 2 8 — 1 = 255 basis functions. Raffenetti 
(1973) proposed an alternative way of storing the indices and the integral to make the 
generation of the HF-LCAO matrix easier. 

For molecules with any substantial amount of symmetry, many of the integrals turn out 
to be plus or minus each other and many can be shown to be identically zero by symmetry 
arguments, without even having to calculate them. Early packages such as POLYATOM 
made extensive use of this symmetry feature, although it must be said that most organic 
molecules have no symmetry to speak of. 

Such old-fashioned HF-LCAO calculations where the integrals are calculated once and 
then stored are usually input-output bound because of the finite time taken for disk or 
magnetic tape transfers compared to the time needed to set up the HF matrix; ideally one 
would like to calculate the integrals just once and hold them in RAM rather than store 
them externally, but early computers had very limited RAM. Technology has moved on, 
and many packages have a so-called incore option, where the integrals are calculated once 
at the start of the HF optimizations, and held in memory. In Gaussian 03 we would put 
SCF = Incore in the route (usage of the word ‘core’ is historical, and dates from the 
time when computer memory was made of magnetized ferrite rings and so people spoke 
about computer cores). Again, processor speed has also increased beyond belief, and it was 
eventually realized that a cost-effective alternative to calculation of the integrals once and 
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repeated read/write operations involving slow devices like disks and magnetic tape was 
to calculate afresh the integrals as and when needed on each cycle. That is the meaning 
of SCF = Direct in the route. The break-even point between Traditional and Direct 
calculations depends on many factors, but roughly speaking calculations involving more 
than 100 basis functions are best done with the Direct option. In fact this is the default 
option in Gaussian 03, I only wrote it out to draw attention to the issue. It is simply a 
historical trade-off between processor power and disk transfer rate. 

The Pop = Full is simply an option to print out all the LCAO coefficients and 
everything else one might possibly need at the end of the calculation. The next two records 
are the title then the charge on the molecule (0 in this case) and the spin multiplicity (it is a 
singlet spin state). Then come the Cartesian coordinates in angstroms, in the usual manner. 
I could have also have input the molecular geometry as a Z-matrix, as I would have been 
obliged to do all those years ago. 


17.1 Output 

I actually ran this job on my laptop under MS/Windows. The output is straightforward and 
I will explain it one piece at a time. 

Entering Link 1 = C:\G03W\ll.exe PID = 2368. 

Copyright (c) 1988,1990,1992,1993,1995,1998,2003, Gaussian, Inc. 

All Rights Reserved. 

This is the Gaussian(R) 03 program. It is based on the 
the Gaussian(R) 98 system (copyright 1998, Gaussian, Inc.), 
the Gaussian (R) 94 system (copyright 1995, Gaussian, Inc.), 
the Gaussian 92(TM) system (copyright 1992, Gaussian, Inc.), 
the Gaussian 90(TM) system (copyright 1990, Gaussian, Inc. ), 
the Gaussian 88(TM) system (copyright 1988, Gaussian, Inc. ), 
the Gaussian 86(TM) system (copyright 1986, Carnegie Mellon 
University), and the Gaussian 82 (TM) system (copyright 1983, 

Carnegie Mellon University) . Gaussian is a federally registered 
trademark of Gaussian, Inc. 

Cite this work as : 

Gaussian 03, Revision B. 01, 

M. J. Frisch, G. W. Trucks, H. B. Schlegel, G. E. Scuseria, 

M. A. Robb, J. R. Cheeseman, J. A. Montgomery, Jr., T. Vreven, 

K. N. Kudin, J. C. Burant, J. M. Millam, S. S. Iyengar, J. Tomasi, 

V. Barone, B. Mennucci, M. Cossi, G. Scalmani, N. Rega, 

G. A. Petersson, H. Nakatsuji, M. Hada, M. Ehara, K. Toyota, 

R. Fukuda, J. Hasegawa, M. Ishida, T. Nakajima, Y. Honda, O. Kitao, 

H. Nakai, M. Klene, X. Li, J. E. Knox, H. P. Hratchian, J. B. Cross, 

C. Adamo, J. Jaramillo, R. Gomperts, R. E. Stratmann, 0. Yazyev, 

A. J. Austin, R. Cammi, C. Pomelli, J. W. Ochterski, P. Y. Ayala, 

K. Morokuma, G. A. Voth, P. Salvador, J. J. Dannenberg, 

V. G. Zakrzewski, S. Dapprich, A. D. Daniels, M. C. Strain, 

0. Farkas, D. K. Malick, A. D. Rabuck, K. Raghavachari, 
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J. B. Foresman, J. V. Ortiz, Q. Cui, A. G. Baboul, S. Clifford, 

J. Cioslowski, B. B. Stefanov, G. Liu, A. Liashenko, P. Piskorz, 
I. Komaromi, R. L. Martin, D. J. Fox, T. Keith, M. A. Al-Laham, 

C. Y. Peng, A. Nanayakkara, M. Challacombe, P. M. W. Gill, 

B. Johnson, W. Chen, M. W. Wong, C. Gonzalez, and J. A. Pople, 
Gaussian, Inc., Pittsburgh PA, 2003. 


Gaussian 03: x86-Win32-G03RevB.01 3-Mar-2003 
25-Sep-2007 


The first part reminds us that packages such as Gaussian 03 have been developed over a 
number of years by a large team of people. There was a time in the 1970s when scientists 
freely exchanged their software, but large packages now tend to be commercial in nature 
and generally subject to the laws of copyright. The text box also serves the purpose of 
giving the appropriate citation for my calculations in this book. 

The next piece of relevant output summarizes the iterative calculations 

Rotational constants (GHZ): 2.0153582 0.6105112 0.5071802 
Standard basis: 6-311G(d,p) (5D, 7F) 

There are 282 symmetry adapted basis functions of A symmetry. 

Integral buffers will be 262144 words long. 

Raffenetti 1 integral format. 

Two-electron integral symmetry is turned on. 

282 basis functions, 472 primitive gaussians, 294 cartesian basis 
functions 

44 alpha electrons 44 beta electrons 
nuclear repulsion energy 695.8408871868 Hartrees . 

NAtoms = 23 NActive = 23 NUniq=23 SFac = 1.00D+00 NAtFMM= 60 Big = F 
One-electron integrals computed using PRISM. 

NBasis = 282 RedAO = T NBF=282 
NBsUse=282 1.00D-06 NBFU=282 

Harris functional with IExCor = 205 diagonalized for initial guess. 
ExpMin= 1.03D-01 ExpMax = 8.59D+03 ExpMxC= 1.30D+03 IAcc=l IRadAn=l 
AccDes= 1.00D-06 

HarFok:IExCor=205 AccDes=1.00D-06IRadAn=1 IDoV=l 
ScaDFX=l.000000 1.000000 1.000000 1.000000 
Initial guess orbital symmetries: 


The electronic state of the initial guess is 1-A. 

Warning! Cutoffs for single-point calculations used. 

Requested convergence on RMS density matrix=l. 00D-04 within 128 cycles. 
Requested convergence on MAX density matrix=l. 00D-02 . 

Requested convergence on energy=5.00D-05 . 

No special actions if energy rises. 

SCF Done: E(RHF)=-551.433305084 A.U. after 7 cycles 
Convg = 0.7930D-05 -V/T= 1.9998 

S**2 = 0.0000 
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You should by now know about most things in the text above apart from INDO. I will 
explain INDO in a later chapter; just accept for the minute that it is a cheap and cheerful way 
of setting off the HF-LCAO iterative calculations. The S**2 item is the expectation value 
of the spin operator and it shows that we are indeed dealing with a pure singlet spin state. 
This should not come as a surprise, since we required pairs of electrons to occupy the same 
spatial orbital. The virial ratio -VIT is the ratio of the expectation values of the potential 
and kinetic energies, and for an exact wavefunction it would equal 2. It is a quantity of 
some historical interest. 

A sensible choice of the initial electron density can make all the difference between 
success and failure; one strategy is to allow a few of the highest occupied and the lowest 
unoccupied orbitals to be partially occupied, in order to try and sort out the ground state 
from the excited states. Most packages make such choices as a matter of course. 

The final energy shown is the total, that is to say electronic plus nuclear repulsion; the 
quantity quoted is technically the reduced energy, e/E h . Next come the orbital energies 
(called not surprisingly EIGENVALUES since they are eigenvalues of the generalized 
HF-LCAO matrix eigenvalue equation) and LCAO coefficients; a small subset is shown in 
the next excerpt. 


Molecular Orbital Coefficients 


3 

(A)- -0 


4 

(A)- -0 


(A)- -0 


EIGENVALUES — -20.62766 -20.55967 -15.54226 -11.39650 -11.28230 


1 1 N IS 

2 2S 

3 2PX 

4 2PY 

5 2PZ 

6 3S 

7 3PX 

8 3PY 

9 3PZ 

10 4S 

11 4PX 

12 4PY 

13 4PZ 


0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

- 0.00002 

0.00000 

0.00001 

- 0.00001 

0.00015 

- 0.00002 

0.00008 

0.00000 


0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

-0.00003 

0.00001 

- 0.00001 

0.00000 

0.00032 

-0.00006 

0.00011 

0.00002 


0.55763 

0.46824 

0.00007 

0.00140 

0.00105 

0.00519 

-0.00004 

-0.00055 

-0.00040 

-0.00177 

0.00045 

- 0.00022 

- 0.00001 


0.00000 

0.00000 

0.00001 

- 0.00001 

0.00000 

-0.00003 

-0.00003 

0.00002 

- 0.00001 

0.00021 

-0.00008 

0.00003 

0.00009 


-0.00009 

-0.00014 

-0.00009 

0.00006 

0.00000 

0.00025 

-0.00042 

0.00024 

-0.00003 

0.00048 

- 0.00022 

0.00009 

0.00003 


The ‘(A)’ refers to the point group symmetry (there is none), the ‘—O’ means an occupied 
orbital. The first 12 MOs describe the heavy atom inner shells. Koopmans’ theorem applies 
to all HF wavefunctions, no matter whether they are at the Hartree-Fock (HF) limit or just 
minimal basis set ones. The orbital energies can therefore be used to predict and rationalize 
ionization energies. 


Mulliken atomic charges: 

1 N -0.467251 

2 H 0.193008 

3 C -0.102938 
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etc. etc. 

21 H 0.084857 

22 C -0.103404 

23 H 0.093449 

Sum of Mulliken charges=0.00000 


Above come Mulliken population analysis indices. As explained in earlier chapters, these 
give a rough-and-ready decomposition of the molecular charge density, but the Mulliken 
partitioning scheme is at best subjective and the numbers have to be treated with caution; 
the numbers should only be used when comparing similar molecules calculated with the 
same basis set. 

Finally we have a number of molecular properties, thought for the day and most important 
of all, timing for the calculation. 


Dipole moment ( field-independent basis, Debye): 

X = -0.5064 Y= 1.2770 Z = 0.0362 Tot=1.3743 
Quadrupole moment (field-independent basis, Debye-Ang) : 

XX=-75.3438 YY=-64.7431 ZZ=-75.3306 
XY = 0.6908 XZ = -4.1359 YZ = -3.8869 
Traceless Quadrupole moment ( field-independent basis, Debye-Ang): 
XX = -3.5380 YY = 7.0627 ZZ = -3.5248 
XY = 0.6908 XZ=-4.1359 YZ=-3.8869 
Octapole moment ( field-independent basis, Debye-Ang**2 ) : 

XXX = -1.5522 YYY= 32.4958 ZZZ = -1.2096 XYY = -0.4675 
XXY = -1.0455 XXZ = -6.3241 XZZ = 13.0673 YZZ = 2.1498 
YYZ=9.8991 XYZ=-6.7480 

Hexadecapole moment (field-independent basis, Debye-Ang**3 ) : 

XXXX = -2014.3972 YYYY = -555.8933 ZZZZ = -188.9884 XXXY = -20.3528 
XXXZ=-65.9431 YYYX=46.4166 YYYZ=-5.4711 ZZZX=-2.5108 
ZZZY = -0.3561 XXYY = -464.9388 XXZZ = -423.3291 YYZZ =-138.1946 
XXYZ = -52.7118 YYXZ = 19.1827 ZZXY = 2.9326 
N-N = 6.958408871868D+02 E-N = -2.683656283961D+03 
KE=5.515438104516D+02 

THE ONLY DIFFERENCE BETWEEN ECCENTRICS AND JOGGERS 
IS THAT JOGGERS WEAR SWEATBANDS WHEN THEY RUN IN THE RAIN. 

Job cpu time: 0 days 0 hours 6 minutes 2.0 seconds . 

File lengths (MBytes) : RWF = 51 lnt = 0 D2E = 0 Chk=12 Scr=l 


17.1.1 Gauss View, the Graphical User Interface for Gaussian 03 

The early versions of Gaussian did not have a GUI, and geometry input by the tedious 
Z-matrix was normal. A number of third-party GUIs appeared, which usually permitted 
constructions of the molecule on-screen. Their ‘output’ was a .gjf, although this was often 
hidden from the user. 
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GaussView is the GUI for Gaussian 03. The latest version is Gauss View 4, and the 
MS/Windows version is GaussViewW. It allows you to build molecules, submit jobs and 
visualize output. The screenshot shown in Figure 17.1 will give the flavour. 



Figure 17.1 A GaussView session 


GaussView sits on top of Gaussian 03; use it or not as you wish. It certainly adds to our 
quality of life. 


17.2 Visualization 

Visualization follows the path discussed in Chapter 15, but naturally there are very many 
more HF-LCAOs than in the case of dihydrogen. In any case, the HF-LCAOs are not 
uniquely determined by the HF procedure. This is because the electronic wavefunction has 
to be written as a Slater determinant; one of the properties of determinants is that we can add 
together multiples of rows and/or columns without changing their value. This is equivalent 
to mixing the HF-LCAOs, and at one time it was fashionable to use this idea in order to 
produce localized orbitals. Organic chemists place great emphasis on the highest occupied 
molecular orbital (HOMO) and the lowest unoccupied molecular orbital (LUMO). I have 
used GaussViewW to produce the HOMO and the LUMO orbital plots for L-phenylanine 
in Figures 17.2 and 17.3. 

The HOMO comprises large contributions from the benzene ring. By contrast, the LUMO 
has a nodal plane through the benzene ring as shown, and a much larger contribution from 
the remainder of the molecule. 

Plotting individual orbitals is all good fun, but there can be very many of them. 
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Figure 17.2 TheHOMOofL-phenylanine 



Figure 17.3 TheLUMOofl-phenylanine 


17.3 Properties 

Once the wavefunction has been determined, then various molecular properties can be 
calculated. My final section of standard output shows a selection of molecular electric 
moments. The so-called ‘quadrupole moment’ diagonal elements do not sum to zero which 
gives the game away; they are actually second moments. 

Such properties are often called primary properties because their electronic part can be 
obtained directly from the wavefunction 

(*)=/ ^(e*-) 


( 17 . 1 ) 
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where the operators refer to each of the n electrons. Usually it is necessary to add a corres¬ 
ponding nuclear contribution, since we work within the Bom-Oppenheimer approximation. 
For example, the electric dipole moment operator is 


Pe = «X! Z “ R “~ e X! r ' 


where the first sum runs over the N nuclei and the second sum over the n electrons. All 
electrons enter the sum on an equal footing, as they should, and the expectation value can 
be written in terms of the charge density P(r) as 



Such electric moments are often reported in non-SI units; the old-fashioned unit of 
length is the angstrom and the debye, itself a relic from the days of electrostatic units, is 
the dipole moment corresponding to a pair of equal and opposite charges of magnitude 
10 -10 electrostatic units (esu = g _1/2 cm 3/2 s -1 ) separated by 1 A (= 10 -10 m). There are 
2.9989 x 10 9 esu per coulomb, and so 1 D = 10 _1 ° esu xlO -10 m or 3.336 x 10 _30 Cm. 
The atomic unit of electric dipole is ea 0 = 8.4784 x 10 -30 Cm, which is 2.5418 D. 


17.3.1 Electrostatic Potential 

One of the fundamental objectives of chemistry is to predict and rationalize chemical react¬ 
ivity. In principle, this involves searching a potential energy surface for saddle points (i.e. 
transition states) and minima (reactants and products), and this kind of detailed investigation 
has only become possible in the last decade. Most of the traditional theories of chemical 
reactivity have concerned themselves with organic molecules, and the simplest theories 
have attempted to extract useful information from the electronic properties of reactants. 
We can distinguish static theories, which in essence make use of the electronic wavefunc- 
tion and/or electronic properties appropriate to an isolated molecule in the gas phase, and 
dynamic theories. Dynamic theories aim (for example) to predict the likely reaction sites 
for the approach of a charged reagent, usually modelled as a point charge. 

The electrostatic potential gives an index that has been widely used since the 1970s for 
just this purpose (e.g. Scrocco and Tomasi 1978). 

Figure 17.4 shows phenylanine; within the Born-Oppenheimer approximation, 
molecules are thought of as point positive charges (the nuclei) surrounded by continuous 
distributions of electron charge. A small portion of the molecular electron charge resides 
in the volume element dr. I can now calculate the electrostatic potential at points in space 
around the molecule using the methods of classical electromagnetism. I have placed a point 
charge Q at the origin; the electrostatic potential 0 at this point will contain contributions 
from the nuclei such as N: 


1 eZ N 
4 ^^ 
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Figure 17.4 Electrostatic potential due to phenylanine 


where Z N = 7, and a contribution from the electrons, which I can write in terms of the 
electron density —eP( r) as 




It is conventional to record the mutual potential energy of the molecule and a unit 
positive charge Q, rather than the electrostatic potential but authors in the field still speak 
of potential energy maps without making the distinction. Again, it is usual to present the 
calculations as contour maps, which would make beautiful illustrations for textbooks like 
this one, if only it were printed in colour. The electrostatic potential maps are used to give 
a rough-and-ready survey of a molecule, and the spirit of the calculation is that one does 
not need a particularly sophisticated basis set. Figure 17.5 shows phenylanine. 



Figure 17.5 Phenylanine electrostatic potential 
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Such objects are usually colour-coded. I have turned the molecule around a little from 
the previous plots. Interpretation is obviously a job for experts, and it is necessary to know 
the colour-coding. 


17.4 Geometry Optimization 

The next step might well be a geometry optimization. All the MM considerations of earlier 
chapters apply here, but there are two maj or differences. A typical MM energy minimization 
on a large biological molecule might involve many thousands of variables. Calculation of 
the energy at points on the molecular potential energy surface is trivial. A typical HF 
calculation will involve fewer variables but the energy calculation for each point on the 
molecular potential energy surface is now much more time consuming because a HF-LCAO 
calculation is involved. 

I explained in Chapter 16 how the HF energy could be written in terms of the electron 
density and various one- and two-electron integrals over the basis functions x■ The HF- 
LCAO matrix is 




and the energy gradient will therefore involve terms like 



where c is a parameter to be varied. Advanced methods of geometry optimization usually 
require both the gradient and the Hessian, and these can either be calculated numerically 
or analytically. It is generally accepted that the analytical methods are to be preferred in 
terms of resource consumption, and over the years a great deal of effort has gone into the 
elucidation of analytical expressions for the energy gradient and the Hessian. All the major 
molecular structure packages now use analytical gradients wherever these are available; 
in the case of the HF-LCAO model, the analytical gradients do not involve the LCAO 
coefficients directly, just the electron density matrix P. This speeds up the calculation by a 
large factor. 

By default, Gaussian 03 performs geometry optimizations in redundant internal coordin¬ 
ates via the Bemy algorithm. The choice of coordinate system for the starting structure 
is irrelevant and any efficiency factors concerned with (for example) molecular sym¬ 
metry are of no consequence since all structures are converted to redundant internal 
coordinates. 
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Let me return to L-phenylanine for my example. For illustration, I will run the job on a 
parallel machine. Because of the way my local Unix system is configured, I have specified 
8 processors each with a maximum of 2 Gb memory. This gives me maximum theoretical 
performance; your system will be different. 


%chk=phenopt.chk 

%nproc=8 

%mem=2Gb 

# HF/6-311G** Opt FreqT 
L-phenylanine geometry optimization 
0 1 

N 0.000 0.000 0.000 

H 0.000 0.000 1.010 

C 1.366 0.000 -0.483 


. (same as before) 


In the Gaussian 03 package, the default optimization method uses analytical expressions 
for the gradient and the Hessian. Here are some relevant parts of the output file. First of all, 
an estimate of the Hessian has to be made. 


GradGradGradGradGradGradGradGradGradGradGradGradGradGradGradGrad 
Berny optimization. 

Initialization pass. 


! Initial Parameters ! 
! (Angstroms and Degrees) ! 


! Name Definition Value Derivative Info. ! 


! R1 R(1,2) 
! R2 R(1,3) 
! R3 R(l,4) 
! R4 R(3,5) 
! R5 R(3,6) 
! R6 R(3,7) 
! R7 R(6,8) 


1.01 estimate D2E/DX2 ! 
1.4489 estimate D2E/DX2 ! 
1.01 estimate D2E/DX2 ! 
1.0902 estimate D2E/DX2 ! 
1.5219 estimate D2E/DX2 ! 
1.5259 estimate D2E/DX2 ! 
1.2301 estimate D2E/DX2 ! 


The first estimate is made numerically. Next the geometry iterations begin. 
The calculation proceeds (hopefully) downwards on the molecular potential energy 
surface: 
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Variable 

R1 

R2 

R3 

R4 

R5 

R6 

R7 


Old X 

1.90188 

2.73522 

1.90126 

2.05499 

2.89023 

2.90544 

2.29763 


-DE/DX 

(Linear) 

-0.00653 

-0.00228 

-0.00697 

-0.00382 

0.00948 

0.00539 

-0.05460 


Delta X 
(Quad) 
-0.00956 
-0.00391 
-0.01045 
-0.00747 
0.02027 
0.03101 
-0.03832 


Delta X 
(Total) 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 


Delta X 

-0.00956 

-0.00391 

-0.01045 

-0.00747 

0.01759 

0.02505 

-0.03832 


New X 

1.89232 

2.73131 

1.89081 

2.04752 

2.90782 

2.93049 

2.25931 


until eventually a stationary point is reached (that is, a point on the molecular potential 
energy surface where the gradient is zero). This could be a minimum or a saddle point: 


! Optimized Parameters ! 
! (Angstroms and Degrees) ! 


! Name Definition Value Derivative Info. ! 


! R1 R(1,2) 1.0012 
! R2 R(1,3) 1.4507 
! R3 R(1,4) 1.0003 
! R4 R(3,5) 1.0824 
! R5 R(3,6) 1.5232 
! R6 R(3,7) 1.5386 
! R7 R(6,8) 1.185 


and so on. 


17.5 Vibrational Analysis 

It is wise to calculate the force constants at this point on the surface, in order to charac¬ 
terize the stationary point. Just to remind you, harmonic force constants correspond to the 
eigenvalues of the Hessian, calculated at the stationary point. A minimum on the molecu¬ 
lar potential energy curve should have 3 N - 6 positive eigenvalues (3 N — 5 for a linear 
molecule), a transition state of the chemical kind will have just one negative eigenvalue of 
the Hessian and so on. In this example there are 23 nuclei and so 69 vibrational coordinates 
in total. This gives 63 vibrational coordinates but because I have chosen to do the calcu¬ 
lation in redundant internal coordinates rather than starting from a Z-matrix, there should 
be six redundant coordinates corresponding to the three translational coordinates of the 
centre of mass and three for the rotations. Each of these should have a force constant of 
zero, but there is a question as to how small a number needs to be before it is taken to be 
nonzero. This has to do with the optimization cut-off point, the convergence criterion for 
the HF-LCAO calculation and so on. 

Here is what I found. 


-DE/DX = 0.0 
-DE/DX = 0.0 
-DE/DX = 0.0 
-DE/DX =0.0 
-DE/DX =0.0 
-DE/DX =0.0 
-DE/DX =0.0 
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Full mass-weighted force constant matrix: 

Low frequencies--0.8888 -0.5279 -0.3445 0.0005 0.0006 0.0012 

Low frequencies- 34.4264 43.3413 60.3827 


The first six are essentially zero and are therefore taken to represent the redundant coordin¬ 
ates. The next piece of output gives the normal modes, as discussed in Chapters 3 and 4. 
Here is a snapshot. 


61 

...(Symmetry)... A 

Frequencies- 3722.9962 

Red. masses— 1.0501 

Frc consts— 8.5758 

IR Inten - - 2.7774 

Raman Activ- 110.9373 

Depolar (P) - 0.1335 

Depolar (U) - 0.2356 


62 

A 

3800.9366 

1.0942 

9.3135 

5.3387 

67.9404 

0.6701 

0.8025 


63 

A 

4108.2863 

1.0651 

10.5919 

106.2867 

72.1007 

0.2986 

0.4599 


Atom AN X Y 

1 7 -0.05 0.02 

2 1 0.23-0.29 

3 6 0.00 0.00 

4 1 0.48-0.05 

5 1 0.00 0.00 

6 6 0.00 0.00 

7 6 0.00 0.00 

8 8 0.00 0.00 

9 1 0.00 0.00 

10 1 0.00 0.00 

11 6 0.00 0.00 

12 8 0.00 0.00 

13 6 0.00 0.00 

14 6 0.00 0.00 

15 1 0.00 0.00 


Z X Y Z 

•0.01 0.02 0.02-0.08 
0.62 0.24 -0.30 0.58 

0.00 0.00 0.00 0.00 

•0.50 -0.51 0.08 0.50 

0.01 0.00 0.00 0.01 

0.00 0.00 0.00 0.00 

0.00 0.00 0.00 0.00 

0.00 0.00 0.00 0.00 

0.00 0.00 0.00 0.00 

0.00 0.00 0.00 0.00 

0.00 0.00 0.00 0.00 

0.00 0.00 0.00 0.00 

0.00 0.00 0.00 0.00 

0.00 0.00 0.00 0.00 

0.00 0.00 0.00 0.00 


X Y Z 

0.00 0.00 0.00 
0.00 0.00 0.00 
0.00 0.00 0.00 
0.00 0.00 0.00 
0.00 0.00 0.00 
0.00 0.00 0.00 
0.00 0.00 0.00 
0.00 0.00 0.00 
0.00 0.00 0.00 
0.00 0.00 0.00 
0.00 0.00 0.00 
0.00 -0.06 0.00 
0.00 0.00 0.00 
0.00 0.00 0.00 
-0.07 1.00 -0.01 


Output includes the vibration frequencies, reduced masses, force constants, infrared 
intensities, Raman activities and depolarization ratios, together with the normal modes 
expressed as linear combinations of the Cartesian coordinates of the atoms. Normal mode 
63 for example, which is of A symmetry, comprises almost entirely the y component for 
atom 15, which is a hydrogen atom. 

Most packages will simulate the infrared spectrum (Figure 17.6) and allow for on-screen 
animation of the normal modes of vibration. I must stress that geometries have to be 
optimized before calculating vibrational frequencies, since these are defined as the second 
derivative calculated at the stationary point. 

I want to illustrate one final point by considering a much smaller molecule, carbon 
dioxide. It is a linear molecule and so has four vibrational degrees of freedom. It is also a 
very topical greenhouse gas. 

Table 17.1 shows the calculated frequencies. The agreement with experiment is not par¬ 
ticularly good. The intensities (not shown) are qualitatively correct; the symmetric stretch 
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Figure 17.6 Vibrational spectra 


Table 17.1 Calculated (FIF-LCAO) and experimental vibration wavenumbers (in cm 1 ) 


Mode 

Expt 

STO-3C 

STO/6-31G* 

STO/6-311G** 

Symmetric stretch (a g ) 

1388.17 

1435.2 

1518.7 

1522.8 

Asymmetric stretch (ct u ) 

2349.16 

2535.7 

2585.4 

2593.5 

Degenerate bend (tt) 

667.40 

566.2 

745.7 

766.8 


is not infrared active because the dipole moment does not change during the vibration, and 
the doubly degenerate bending vibration has a lower intensity than the asymmetric stretch. 

One decision that has to be made with a HF-LCAO calculation is the choice of basis 
set, and this is usually a matter of experience. I ran the calculation with three fairly 
simple basis sets, and the results are also shown in Table 17.1. Apart from the minimal 
basis set calculation, it is a common experience that HF-LCAO calculations overestimate 
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force constants by about 10%. Provided the same basis set is used from molecule to 
molecule, it is possible to scale the calculation by a fixed factor to gain very reliable 
vibrational data. 


17.5.1 Hellmann Feynman Theorem 

Before I finish this subsection, I want to tell you about a very early (but unsuccessful) 
attempt to take a shortcut in energy minimizations. 

Suppose that an electronic wavefunction 'I' depends on a single parameter c such as a 
single bond length or an orbital exponent. According to the variation principle, the best 
value of c is the one that minimizes the energy 

s (c) = j V* {c)HV (c) dr (17.3) 

subject to the requirement that the wavefunction is normalized 

J W (c)«F (c) dr = 1 (17.4) 

If we differentiate these two equations with respect to c we have 

de (c) fdW*(c)-~ f dH C ~ d# (c) 

—-^ = / — —HV(c) dr + / iP*(c) —<P(c) dr+ / ^dr 

d c J dc J dc J 3c 

(17.5) 

r dW* (c) r dW (c) 

/ - —<I'(c)dr+ / !F*(C)-—dr = 0 

J dc J dc 

Normally 1 > (c) will be an approximate wavefunction but suppose that (c) happens to 
be an eigenfunction of the Hamiltonian. A little analysis shows that 

de (c) f dH 

= / V {c)—V (c) dr (17.6) 

This result is known as the Hellmann Feynman theorem, and it was hoped in the early days 
that it could be used to optimize geometries. The first applications were to isoelectronic 
processes 

X -► Y 

such as the internal rotation in ethane, where it was believed that the only nonzero contri¬ 
butions to the right-hand side of Equation (17.6) would be due to changes in the nuclear 
positions and hence in the one-electron integrals. It was hoped that such calculations would 
be useful for geometry optimizations, where the changes also involve two-electron integ¬ 
rals but enthusiasm vanished once it became clear that approximate wavefunctions also 
depend on the parameter(s) contained either explicitly or implicitly in the wavefunction 
and so wavefunction gradient terms such as d'P/dc cannot be neglected. 
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17.6 Thermodynamic Properties 

In Chapter 8 I discussed the canonical ensemble and the canonical partition function 

e = X>p (~§f) < 17T > 

I added a * superscript to focus attention on the fact that E* refers to the collection of 
particles in each cell of the ensemble. There will be many possible allowed values for 
E*, and the N particles in each cell will contribute in some way to make up the total. So 
far, I have made no assumptions whatever about these particles, neither have I made any 
assumption about the way in which they might interact with each other. 

Suppose now that the N particles are identical molecules, but that they form essentially 
an ideal gas. One characteristic of an ideal gas is that there is no interaction between the 
particles, so the total energy E* of the N particles will be a simple sum of the particle 
energies. If I label the molecules 1, 2, 3,..., N then we can write 

E* = e (1) + e (2) H-1- e m (17.8) 

Each molecular energy will contain a kinetic and a potential part, but there are no inter- 
molecular interactions because of the ideal gas behaviour. So for each possible value of E* 
we have 

E* = + sj 2) + • • • + (17.9) 

Each allowed value of E* will correspond to different values of the constituent £’s and a 
simple rearrangement of Q shows that 

2 = (-^5=) X> (-£=)•• E“P (-$) 07.10) 

I have used a different dummy index i,j ,..., n but the sums are all the same since they refer 
to identical particles. At first sight we should therefore write 

2 = (l>P (-£?)) 

This assumes that the N particles can somehow be distinguished one from another (we say 
that such particles are distinguishable). The laws of quantum mechanics tell us that seem¬ 
ingly identical particles are truly identical, they are indistinguishableand to allow for this 
we have to divide Q by the number of ways in which N things can be permuted: 

2=^,(x> p (-^)) 07.11) 

I have dropped all reference to particle T’, since all N are identical. 

The summation 



<1 = J2 ex P 


(17.12) 
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refers only to the energy states of a representative particle, and it is a quantity of great 
interest in our theories. It is called the molecular partition function and given the symbol 
q (sometimes z). 


17.6.1 Ideal Monatomic Gas 

Consider now an even more restricted case, a monatomic ideal gas. Each atom has mass 
m and the gas is constrained to a cubical container whose sides are a, b and c (so that the 
volume V is abc). We will ignore the fact that each atom could well have different electronic 
energy, and concentrate on the translational energy. Elementary quantum mechanics texts 
(and Chapter 11) show that the available (translational) energy levels are characterized by 
three quantum numbers n x , n y and n z which can each take integral values ranging from 1 
to infinity. The translational energy is given by 



and so each particle will make a contribution 



(17.13) 


to the translational molecular partition function. Each of the three summations can be treated 
as follows. A simple calculation shows that, for an ordinary atom such as argon at room 
temperature constrained to such a macroscopic container, typical quantum numbers are of 
the order of 10 9 by which time the allowed energy states essentially form a continuum. 
Under these circumstances, we replace the summation by an integral and treat n x as a 
continuous variable. So for example 




h 


The translational partition functions are then given by 




(17.14) 
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The thermodynamic functions can be calculated using the equations given in Chapter 8. 
We have for example 



and so 


V = -Nk B T 
2 


For a monatomic ideal gas, the translational kinetic energy is the only energy that an 
atom will possess. The internal energy is therefore just what we would expect from the 
equipartition of energy principle. In a similar way we find that the entropy is 



(17.15) 


and this is called the Sackur-Tetrode equation. 

17.6.2 Ideal Diatomic Gas 

Monatomic gas particles can only store translational kinetic energy, whilst, for a polyatomic 
molecule, we have to consider: 

• rotational energy; 

• vibrational energy. 

We also need to consider electronic energies. In order to progress, we make the assumption 
that the different energy modes are independent, and that each set has its own Boltzmann 
distribution. We therefore write for a typical molecule 


(17.16) 


S = £(rans + £ rot + £ vib + £ etoc 


and it is easily seen that the molecular partition function is the product of a translational, 
rotational, vibrational and electronic molecular partition function. These are related to the 
canonical partition function by 


Q — T77 (7.ran s ^ro.^v,b7clcc)' V 


(17.17) 


17.6.3 q mt 

The rotational energy levels for a rigid diatomic rotator are given by 



= Bhc 0 J (J + 1) 


(17.18) 


where the rotational quantum number J takes integral values 0,1,2,..., I is the moment of 
inertia about an axis through the centre of mass of the molecule and c 0 is the speed of light 
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in vacuo. B is called the rotation constant. Each energy level is (2 1 + l)-fold degenerate, 
and so all 2J + 1 individual quantum states must be counted in the Boltzmann formula: 


^ / Bhc 0 J(J+ 1)\ 

q mt = I2 o ( - 2J+ !) ex P { --J 


The sum cannot be found in a closed form, but we note that for molecules with small 
rotational constants, the rotational energy levels crowd together and the summation can be 
replaced by an integral, treating / as a continuous variable: 


7 ( Bhc 0 J (J + 1) \ 

q mt = / (2/ + 1) exp I-—-Id/ 

o V B 7 (17.19) 

_ k B T 
hc 0 B 

Equation (17.19) works with complete reliability for all heteronuclear diatomic 
molecules, subject to the accuracy of the rotational energy level formula and the applicab¬ 
ility of the continuum approximation. For homonuclear diatomics we have to think more 
carefully about indistinguishability of the two nuclei which results in occupancy of either 
the odd / or the even / levels; this depends on what the nuclei are, and need not concern us 
in detail here. It is dealt with by introducing a symmetry factor <r that is 1 for a heteronuclear 
diatomic and 2 for a homonuclear diatomic. Parameter q mt is written 


<7rot = 


1 k B T 
a hc 0 B 


(17.20) 


The so-called rotational temperature 0 mt is often used in discussions of statistical 
mechanics; we rewrite Equation (17.20) as 


(17.21) 


17.6.4 q vib 

Vibrational energy levels have separations that are at least an order of magnitude greater 
than the rotational modes, which are in turn some 20 orders of magnitude greater than 
the translational modes. As a consequence, the spacing is comparable to k B T for everyday 
molecules and temperatures. If we consider a single harmonic vibrational mode for which 


e y = hc 0 a>e (v + l) 



and if we take the energy zero as that of the energy level v = 0 we have 


q ** 


X> p (- 


hc 0 V(o e \ 

k B T J 
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This simple geometric series can be summed exactly to give 
1 

9u,b - 1 f \ 

and once again it is usually written in terms of the vibrational temperature (9 vib : 


l-exp(-^) 


Finally we need to know the electronic partition function. This is almost always 1, unless 
the electronic ground state is degenerate (in which case it is equal to the degeneracy) or 
unless there is a very low-lying excited state. 


17.7 Back to L-Phenylanine 


Let me now return to L-phenylanine. The Gaussian 03 ‘frequency’ calculation also gives 
thermodynamic properties, calculated along the lines discussed above. Here is an abbrevi¬ 
ated output. First we have moments of inertia and then a calculation of the rotational and 
vibrational temperatures. 


Principal axes and moments of inertia in atomic units: 


1 


EIGENVALUES - - 




3017.84285 
.01842 


3333.43073 

0.00491 

-0.02889 

0.99957 


1026.19740 
0.99982 
-0.01827 
-0.00544 

m asymmetric top. 

Rotational symmetry number 1. 

Warning — assumption of classical behavior for rotation 
may cause significant error 

Rotational temperatures (Kelvin) 0.08440 0.02870 0.02598 

Rotational constants (GHZ): 1.75867 0.59802 0.54141 

Zero-point vibrational energy 532871.0 (Joules/Mol) 

127.35923 (Kcal/Mol) 


Vibrational temperatures: 49.53 62.36 86.88 142.01 

(Kelvin) 405.40 422.99 483.72 489.81 

653.24 713.44 813.05 927.86 


280.87 

545.09 

952.80 


Next come the various contributions to the internal energy U (here called E), the heat 
capacity C and the entropy S. 


Zero-point correction = 

Thermal correction to Energy= 
Thermal correction to Enthalpy = 


0.202960 (Hartree/Particle) 

0.213506 

0.214451 
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Thermal correction to Gibbs Free Energy = 0.164839 

Sum of electronic and zero-point Energies= -551.349677 

Sum of electronic and thermal Energies= -551.339131 

Sum of electronic and thermal Enthalpies = -551.338187 

Sum of electronic and thermal Free Energies= -551.387798 


E (Thermal) 
KCal/Mol 


Total 133.977 
Electronic 0.000 
Translational 0.889 
Rotational 0.889 
Vibrational 132.200 
Vibration 1 0.594 
Vibration 2 0.595 
Vibration 3 0.597 


CV S 

Cal/Mol-Kelvin Cal/Mol-Kelvin 
38.971 104.416 

0.000 0.000 

2.981 41.212 

2.981 30.713 

33.010 32.490 

1.983 5.556 

1.980 5.100 

1.973 4.445 


. . . etc 


The partition functions are also printed. 


17.8 Excited States 

Electronic ground states are all well and good, but there is a wealth of experimental spec¬ 
troscopic data that relates to the excited states of molecules. One of the simplest and most 
widely studied sequences of organic molecules to be investigated by spectrophotometry is 
afforded by the set of monocyclic azines, of which pyridine forms the first member. It is 
instructive to compare this series with benzene. 

All the molecules have 42 electrons, 21 electron pairs, and I can write their ground state 
electronic configurations 

% : if if iff ) 2 ■ • • ifnf 

We refer to P 0 as the reference state. Single excitations can be produced by promoting an 
electron from one of the occupied orbitals to an unoccupied (virtual) one, just as we did in 
Chapter 15 for dihydrogen, so the first few single excitations have configurations 

'Pi : i 

P 2 ■ f\f\ • • • fIlf21 
^3 : ... fjofiifL 

P A : iflfl ... f^f&f^ 

Electron spin has to be taken into account, and proper Slater determinants constructed. 
The ground-state wavefunction will be a single Slater determinant but there are four pos¬ 
sibilities (say, (a), (b), (c) and (d)) for each of the four excited states. In our dihydrogen 
calculation, I grouped the determinants together so that they were spin eigenfunc¬ 
tions. In modem configuration interaction (Cl) calculations, we work directly with the 
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individual Slater determinants and classify the resulting wavefunctions at the end of the 
calculation. 

We write the Cl wavefunction as 


*ci = c 0 ^o + Cla^la + c lb «^ lb + • • ■ 


where the CI expansion coefficients have to be determined from a variational calculation. 
This involves finding the eigenvalues and eigenvectors of a matrix whose elements are 
typically 


j VjVj dr 


The matrix elements can be determined from the Slater-Condon-Shortley rules, giving 
typically HF-LCAO orbital energies and various two-electron integrals such as 



These have to be calculated from the two-electron integrals over the HF-LCAO basis 
functions, at first sight a four-dimensional sum known in the trade as the four-index 
transformation. 

Two of each of the (a), (b), (c) and (d) Slater determinants correspond to spin eigenfunc¬ 
tions having spin quantum number M s = 0, one to M s = +1 and one to M s — —1 and so 
need not be considered because states of different spin symmetry do not mix. In addition, 
Brillouin’s theorem (Brillouin 1933) tells us that singly excited states constructed using 
HF wavefunctions do not mix with the ground state for a closed shell system, so we do not 
need to include the ground state B in the variational calculation. 

If we take all possible excited Slater determinants and solve the variational problem we 
reach a level of theory known as Cl singles (CIS). A keynote paper is that by Foresman 
et al. (1992), and their synopsis puts everything into perspective. 

This work reviews the methodological and computational considerations necessary for the 
determination of the ab initio energy, wavefunction and gradient of a molecule in an electron¬ 
ically excited state using molecular orbital theory. In particular, this paper re-examines a funda¬ 
mental level of theory which was employed several years ago for the interpretation of the elec¬ 
tronic spectra of simple organic molecules: configuration interaction (CI) among all singly sub¬ 
stituted determinants using a Hartree Fock reference state. This investigation presents several 
new enhancements to this general theory. First, it is shown how the CI singles wavefunction can 
be used to compute efficiently the analytic first derivative of the energy... Second, a computer 
program is described which allows these computations to be done in a ‘direct’ fashion. 

You should have picked up the words ‘direct’ and ‘gradient’. 

To return to my example of the azines; in the earliest and necessarily qualitative 
treatments, the first four transitions were identified with the single excitations shown in 
Figure 17.7. Benzene can be regarded as a special case of the series and the highest occu¬ 
pied orbitals are doubly degenerate, as are the lowest unoccupied ones. This leads to many 
degeneracies in the excited states that simply do not appear (other than by accident) in the 
substituted azines. The assignment of bands in the benzene spectrum led to considerable 
discussion in the primary literature. 
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Figure 17.7 Simple Cl treatment of azines 


Excitation energies and oscillator strengths: 
Excited State 1: Singlet-Bl 6.2005 eV 199.96 n 


19 ->25 
19 ->29 
19 ->36 
19 ->51 


0.60076 

0.16645 

-0.24330 

-0.17296 


Excited State 2: Singlet-B2 6.2623 eV 197.99 nm f=0.0882 
20 ->28 0.32051 

20 ->39 -0.11996 

21 ->25 0.57094 

21 ->29 0.14308 

21 ->36 -0.14253 

Excited State 3: Singlet-Al 6.5730 eV 188.63 nm f=0.0191 


20 ->25 
20 ->29 

20 ->36 

21 ->28 
21 ->39 


-0.39008 

-0.12134 

0.10609 

0.52009 

-0.15078 


Excited State 4: Singlet-A2 7.1312 eV 173.86 nm 
21 ->22 0.65005 

21 ->24 0.12283 

21 ->30 -0.14334 

21 ->44 -0.12506 

Excited State 5: Singlet-A2 7.3500 eV 168.69 n 
19 ->28 0.65180 

19 ->39 -0.24025 

Excited State 6: Singlet-Bl 7.6497 eV 162.08 m 

20 ->22 -0.29313 

21 ->23 0.61432 

21 ->45 0.11735 


f=0.0671 
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Excited State 
21 ->22 
21 ->24 
21 ->30 


7: Singlet-A2 8.1194 eV 152.70 
-0.14317 
0.64628 
-0.13451 


f=0.0000 


Excited State 
20 ->22 

20 ->30 

21 ->23 
21 ->26 


8: Singlet-Bl 8.1247 eV 152.60 
0.57081 
-0.11602 
0.28223 
-0.19020 


f=0.0011 


The text above shows sample output from a CIS singlet state calculation on pyridine, and 
I have only included the first eight excited states. To allow for the possibility that excited 
states are diffuse in nature, I added the *+’ diffuse functions to my 6-311** basis set. 

The MOs r/r 2 o and i/'ii are both tt type, whilst r/r 22 and ^r 23 are antibonding tt*. The 19th 
occupied orbital corresponds to the N lone pair. The lowest singlet excited state is n —>■ tt* 
as is the fourth. The remainder in my small sample are all tt —> tt*. The notation (for 
example) 20— > 220.57081 gives the contribution made to that particular excited state 
from the singly excited state 1^20 —>■ V / 22- 

I have collected a small number of calculated data in Table 17.2. In each case I have 
recorded the first n —» tt* and first two allowed tt —» tt* transitions, together with the 
oscillator strengths. 


Table 17.2 Spectroscopic data for various cyclic molecules: X (in nm) and oscillator strength 
f (d means doubly degenerate) 



Benzene 

Pyridine 

1,2-Diazine 

1,3-Diazine 

1,4-Diazine 


_ 

200.0 

249.8 

207.9 

238.4 



0.0072 

0.0126 

0.0131 

0.0110 

TT —>■ TT* 

167.9 

198.0 

186.5 

183.7 

197.4 


0.0965 

0.0882 

0.0425 

0.0747 

0.1901 

TT —> IT* 

156.7 

188.6 

145.3 

173.5 

180.2 


0.9169 

0.0191 

0.0073 

0.0389 

0.0777 


A satisfactory theory of the spectra of aromatic hydrocarbons was not developed until 
there had been a detailed classification of their absorption bands. This important work was 
first undertaken by Clar (1952). There are three types of absorption bands that are classified 
mainly by their intensity but also by their vibrational fine structure and the frequency shifts 
within a family of compounds. The ground state of benzene (with D 6h symmetry) has 
closed shell configuration aj u ei g which gives symmetry A lg . The first excited configuration 
a iu e i g e 2u g' ves rise to states of symmetry B lu , B 2u and E lu which can be either singlet or 
triplet. The experimental absorption spectrum shows a weak band near 260 nm, and a strong 
band at 185.0 nm. The CIS calculation gives a weak band at 167.9 nm and a strong band at 
156.7 nm and so our calculation accounts only qualitatively for the features of the benzene 
spectrum. 
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The lowest singlet excitation for the azines is n —»■ it* in character, and Mason’s (1962) 
experimental values are 270, 340, 298 and 328 nm. The CIS calculation gives the trend 
correctly but the absolute agreement with experiment is poor. Similar conclusions apply to 
the it —> tt* transitions. The general conclusion is that the CIS method gives a useful first 
approximation to the excited states. 


17.9 Consequences of the Brillouin Theorem 

I mentioned primary properties in Section 17.3; these are ones such as the electric dipole 
moment, which is given as the expectation value of a sum of one-electron operators (together 
with the corresponding nuclear contribution): 

(X) = j V* ^idr 

Suppose that we have a singlet ground-state closed shell HF wavefunction <f> 0 and we try to 
improve it by adding singly excited states; if I denote occupied orbitals A, B,... and virtual 
ones X, Y,... then 

= Co % + Y c!** (17-24) 

where c A x is the Cl expansion coefficient and <^ A X the appropriate combination of Slater 
determinants describing a singlet singly excited state. We find 

(. X ) =cl j X,%dr + 2J2 C x c 0 j ^Y X^ldr + Y c a c b J Y X^dr 

(17.25) 

and all terms on the right-hand side apart from the first are zero, since the c’s are zero by 
Brillouin’s theorem. In addition the only nonzero integrals in the third sum are those where 
the excited states differ by no more than one spin orbital. This can be easily proved by 
application of the Slater-Condon-Shortley rules. 

If we now repeat the argument but include doubly excited rather than singly excited states 

^=c 0 ^o+ Y c ab^a X b Y 

A.B.X.Y 

w = c o f ^ YWx + 2 Y O-o f %Yx^dz (17 26) 
+ Y E « / ^Yx#™ dr 

then the Cl expansion coefficients are no longer zero. However, the integrals in the second 
sum are all zero because the doubly excited states differ from & 0 by two spin orbitals. All 
the integrals in the third sum except those where are zero for the same reason, 

and the final expression is 


HF-LCAO Examples 301 


(X)=c 2 0 / %^X t %AT + Y / (17.27) 

d j=l A,B,X,Y ^ i=l 

At first sight, one-electron properties calculated from HF wavefunctions ought to be very 
accurate because the lowest ‘correction’ terms are certain doubly excited configurations. 

The argument is correct as far as it goes, but if we include both the singly and doubly 
excited states together 

^ = c 0 % + Y c a*^a X + Y (17.28) 

the singly excited states couple with the doubly excited ones, which couple with the ground 
state and so the singly excited states do make a contribution to the ground state. 

In any case, the key choice when performing HF-LCAO calculations is the choice of 
atomic orbital basis set, as can be seen from my representative HF-LCAO calculations on 
pyridine in Table 17.3. In each case I optimized the geometry. 


Table 17.3 Electric dipole moment for 
pyridine; EIF-LCAO model 


Level of theory 

p e /10 -30 C m 

HF/STO-3G 

6.870 

HF/6-31G 

8.811 

HF/6-31 G* 

7.720 

HF/6-311G** 

7.553 

HF/cc-pvQz 

10.846 

Experiment 

7.31 ±2% 


17.10 Electric Field Gradients 


I have made mention at several points in the text of electron spin, the internal angu¬ 
lar momentum possessed by electrons just because they are electrons. The total angular 
momentum of an atomic electron is found by summing the electron spin and the electron 
orbital angular momentum. 

Many nuclei have a corresponding internal angular momentum, which is given the sym¬ 
bol I and the vector is characterized by two quantum numbers I and m ,. The vector obeys the 
laws of quantum mechanics appropriate to angular momenta; for each value of I, m, — —I, 
—I+l,...+I, the size of the vector is 


and we can only measure simultaneously the size of the vector and one component that we 
conventionally take as the z component. The z component has size 


h 

'b — 

2 7T 


The nuclear spin quantum number is characteristic of a given nucleus and can have 
different values for different isotopic species. A sample is shown in Table 17.4. 
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Table 17.4 Representative nuclear 
spin quantum numbers 


Isotopic species / 

’H 

1/2 

2 D 

1 

12 C 

0 

,3 C 

'h 

,7 o 

5/2 


Many nuclei with / > 1 also possess a nuclear quadrupole Q n . It is usually defined in 
terms of the nuclear charge distribution p n (r) as 

j / / P„ (3x 2 - r 2 ) dr 3 / p B xydr 3 / p„x Z dr \ 

Qn = - 3 / p„yxdr / p n (3y 2 - r 2 ) dr 3 / p n yzdr (17.29) 

\ 3 / p n zxdr 3 / p n zydr / p n (3z 2 - r 2 ) dr / 

This definition gives a traceless tensor. Here the integration is over the nuclear charge 
distribution. Nuclear wavefunctions are pretty hard to come by, and we normally have to 
determine the components of Q n by experiment. 

In a molecule, a given nucleus will generally experience an electric field gradient due 
to the surrounding electrons. Electric fields are vector quantities and so an electric field 
gradient is a tensor quantity. The electric field gradient at nucleus n is usually written q n . 
The energy of interaction U between the nuclear quadrupole and the electric field gradient is 

U=~ e -Y J iQn)ij{<ln) ii (17-30) 

In principal axes, the interaction is determined by Q a = (Q„) zz and q n — the largest of the 
diagonal components of q n . The quantity eQ n q n /h is referred to as the quadrupole coupling 
constant. According to Townes and Dailey (1949), since filled shells and s orbitals have 
spherical symmetry, and since d and f orbitals do not penetrate near the nucleus, the quadru¬ 
pole coupling constant should be largely due to any p electrons present in the valence shell. 

Molecular quadrupole coupling constants are usually determined from the hyperfine 
structure of pure rotational spectra or from electric beam and magnetic beam reson¬ 
ance spectroscopies. Nuclear magnetic resonance, electron spin resonance and Mossbauer 
spectroscopies are also routes to this property. 

I can use the well-studied series HCN, FCN and C1CN to illustrate a couple of points. 
These molecules, including many isotopic species, have been exhaustively studied by spec¬ 
troscopic techniques. An interesting feature of the experimental geometries is that the CN 
bond lengths are almost identical in length yet the 14 N quadrupole coupling constant (QCC) 
is quite different (Table 17.5). 

Table 17.6 shows results calculated at the HF/6-311G** level of theory, including 
geometry optimization. 





HF-LCAO Examples 303 


Table 17.5 Experimental results for XCN 


Molecule R(C=N)/pm 14 N QCC/MHz 


HCN 

FCN 

CICN 


115.5 

115.9 

115.9 


-4.58 

-2.67 

-3.63 


Table 17.6 HF/6-311C** results for XCN 


Molecule R(C=N)/pm 14 N QCC/MHz 


HCN 

FCN 

CICN 


112.7 

112.5 


112.8 


-4.53 

-2.95 

-3.85 


The agreement with experiment is good to within a few percent. What chemical insight 
these calculations give into the reason why the three coupling constants are so different is 
not clear! 


17.11 Hyperfine Interactions 

We considered a simple two-state system in Chapter 7; I took for granted you were famil¬ 
iar with electron and nuclear spin, and had come across nuclear magnetic resonance and 
electron spin resonance. 

Atomic and molecular magnetic dipoles have to obey the quantum mechanical laws of 
angular momentum, and each dipole can make only a number of allowed orientations with 
an applied magnetic field. Each orientation corresponds to a different energy and absorption 
of a suitable photon may cause a change in orientation. This is the physical basis of the 
magnetic resonance experiment. 

The energy level differences are tiny in comparison to electronic ones, and it is conven¬ 
tional to discuss such hyperfine effects as perturbations on the electronic energy levels. 
In the electron spin resonance experiment, there are two terms of interest. First there is a 
classical electron spin dipole-nuclear spin dipole term 



(17.31) 


that averages to zero for tumbling species in the gas phase or in solution. 
Second there is the Fermi contact term 




: s. J-SWOR-) 


(17.32) 
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Here I is a nuclear angular momentum operator, S an electron angular momentum operator 
and R„, the vector joining nucleus a and electron i. The g’s and /3’s are the corresponding 
//-factors and magnetons. 

Experimentalists work with a spin Hamiltonian which in the latter case would be written 

(17.33) 

This spin Hamiltonian operates only on spin wavefunctions, and all electronic details are 
absorbed into the coupling constant a„. These can be written very neatly in terms of the 
spin density Q( r) as 

o a = ^gd3 egal 8 a -£jQ( RJ (17.34) 

where the spin density has to be evaluated at the position of nucleus a. The spin density is 
the difference between the a- and the /3-spin electrons, and unlike the total electron density 
it can have negative as well as positive regions. 

To give an example, the vinyl radical H 2 C=CH (Figure 17.8) can be produced by cleavage 
of a C-H bond in ethane, and it has been well studied in the gas phase. 



Figure 17.8 The vinyl radical 


AROHF-6-311G** calculation gives the HOMO shown in Figure 17.9. In the language 
of descriptive organic chemistry, this corresponds to the odd electron occupying a C sp 3 
hybrid, but of course there are negative regions as well as positive. The spin density is 
just the square of the HOMO and so is positive everywhere. The experimental evidence 
is that coupling constants can be positive or negative and so the ROHF model is not 
satisfactory. 

In the UHF model, the a- and /3-spin electrons occupy spatially different orbitals and so 
the spin density can indeed be negative in regions of space. AUHF/6-311G** calculation on 
the vinyl radical gives regions of space where the spin density is negative and in particular 
it is negative at the position of the second carbon atom. 

Some results of calculations and experiments are given in Table 17.7. I have put the 
experimental negative signs in brackets because the sign of the coupling constant cannot 
easily be deduced from studies in solution. Agreement with experiment is not particularly 
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Figure 17.9 ROHF calculation on the vinyl radical 


good in either case, and a further major problem with the UHF model is that it does not 
necessarily give a pure spin state 

h 2 

S 2 V v hf ± s (s + 1)-'/W 

An 2 

In general, if there are p electrons of a spin and q electrons of fi spin, the UHF wavefunction 
can contain spin components ranging from i = \(p — q) to ' (p + q). The expectation 
value 

(s 2 ) = J <fWS 2 tfWdrdi 

is taken as a measure of the spin impurities. In this calculation it came out as 0.9901 instead 
of the pure doublet value 3/4. 


Table 17.7 Flyperfine coupling constants (in mT) for vinyl radical 



C, 

H, 

C 2 

h 2 

h 3 

Experiment 

10.76 

(—)1.33 

(—)0.86 

6.81 

3.41 

ROHF 

11.49 

2.37 

1.19 

2.20 

1.32 

UHF 

17.75 

-1.33 

-4.22 

6.53 

4.56 


17.12 Problem Set 


A set of problems, together with solutions, is available on the associated website. 
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18 

Semiempirical Models 


I have spent a good deal of time discussing the current state of the art in quantum chemical 
calculations, and I have deliberately focused on techniques where we perform all of the 
calculations as exactly as possible. Over the years, it has become fashionable to refer to 
such models as ab initio. This Latin phrase translates into ‘from the beginning’; it certainly 
does not necessarily mean ‘correct’. Once we have stated the Hamiltonian and if necessary 
the atomic orbital basis set then all the integrals are evaluated exactly without any attempt 
at calibrating them against experiment. Thus the atomic Hartree-Fock model is an ab initio 
model, and the calculations I described for the hydrogen molecule ion and dihydrogen are 
ab initio calculations. 

These early calculations were done in the 1920s and the 1930s, and you probably noticed 
that I then jumped to the 1970s to discuss modern HF-LCAO calculations with GTO. 
Scientists first got their hands on computers in a meaningful way in the 1970s, and one 
unfortunate characteristic of ab initio calculations is that they are compute-intensive. From 
the early days, by necessity rather than choice, workers tried to develop simple HF-LCAO- 
based models that applied to certain classes of molecule, and a characteristic of these models 
is that they avoid all the difficult integrals. There is usually a price to pay, and the models 
have to be calibrated against experiment; some workers regard this as a major advantage in 
order to achieve meaningful chemical predictions from cheap calculations. We often refer 
to such calibrated models as semiempirical. This is not a Latin phrase but it translates into 
‘based on a correct model but calibrated and/or parameterized’. We can of course argue 
for ever about what constitutes a ‘correct’ model; Hartree-Fock theory is certainly not 
absolutely ‘correct’ because it averages over the electron interactions and so it usually runs 
into trouble when trying to describe bond breaking. 
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18.1 Hiickel 7r-Electron Theory 

Molecules such as benzene, pyridine and naphthalene that have extensive 7r-bonding sys¬ 
tems are not well described by classical valence theory. Such molecules are often planar, 
or largely planar, and two types of bond contribute to their structure: o and tt bonds. 
The er bonds are thought of as largely localized between atoms, whilst the jt bonds are to 
be thought of as delocalized over large portions of the molecule. Much of the interesting 
chemistry of such compounds appears to relate to the so-called it -electrons, and we visu¬ 
alize these 7i -electrons as moving under the influence of the nuclear framework and the 
ct- electrons. The o -electrons are thought to form an inert core. 

We meet such compounds every day; for example, /6-carotene is found in carrots and is 
known to be a biological precursor of vitamin A. In 1933, the compound benzo[a]pyrene 
was isolated from coal tar; this compound and 7,12-dimethylbenz[a]anthracene are two of 
the most potent carcinogens known. Such materials are found in soot and cigarette smoke. 
On the other hand, methyl orange is a prototype for of a class of commercially important 
compounds known as azo dyes. The structures of /6-carotene, benzo[a]pyrene and methyl 
orange are shown in Figure 18.1. 



/3-carotene 





benzo[a]pyrene 


methyl orange 


Figure 18.1 Some conjugated systems 


Probably for all these reasons, people have striven from the early days of quantum 
chemistry to give simple models of such compounds in an effort to: 

• explain their special structure, stability and properties; 

• predict new properties of related but undiscovered molecules (without the labour of 
having to actually synthesize them). 

One of the earliest models for 7r-electron molecules is the Hiickel it -electron model, 
which dates from 1931 (Hiickel 1931). The ideas are simple and chemically appealing, and 
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the model enjoyed many years of successful application to individual molecules, clusters 
and solids. 

Imagine then a simple molecule such as ethene; there are 16 electrons in total (14rr + 2nr) 
and as usual we concentrate on the electronic wavefunction. The total electronic wavefunc- 
tion will be some complicated function of the space and spin variables of all 16 electrons, 
but a basic tenet of Htickel theory is that we can absorb the 14<r-electrons into the ‘core’ 
and just concentrate on the 2 u -electrons. 

This is a perfectly sound procedure, akin to the idea of effective core potentials to be 
discussed later. 

In its simplest form, Htickel theory uses the HF-LCAO idea and seeks to write the n 
molecular orbitals as linear combinations of a suitable set of atomic basis functions; these 
are not actually defined but we can think of them as the carbon 2p T atomic orbitals. Because 
the atomic 2p T orbitals have a nodal plane in the molecular plane, we do not need to consider 
any other atomic orbitals (such as the C 2s and the H Is). So, it is an extremely minimal 
basis set calculation. 

If I label the basis functions Xi and Xi, then we write the HF-LCAO’s i/r as 
f = C \X\ + C2X2 

and find the c’s from the standard HF-LCAO matrix eigenvalue problem 
h F c = eSc. 

At this point, Htickel theory introduces a number of simplifying assumptions. 

• The basis functions are assumed not to overlap each other, and we also take them to be 
normalized. This means that the overlap matrix S is the unit matrix and so the HF-LCAO 
matrix eigenvalue problem simplifies to h F c = ec. 

• The elements of the HF-LCAO Hamiltonian are taken to be constants that depend only 
on the atom orbital types involved. The ‘atom’ parameter a depends only on the type 
of atom and not the chemical environment; the ‘bond’ parameters depend only on the 
constituent atom types. Matrix elements between orbitals on nonbonded atoms are set 
to zero. This means there are no HF-LCAO-type iterations, all that is needed is a single 
matrix diagonalization. In Htickel’s day this was a major problem. 

• Atom and bond parameters are only given numerical values as a last resort. 

Thus, for example, in the case of ethene we have to solve 



No HF-SCF iterations are needed because the terms in the square matrix are constant; they 
do not depend on the electron density (through the c’s). At first sight, it looks like we might 
need numerical values for a c and fi cc but the trick is to divide through by /3 CC and put 
x = (a c — e)/pec giving a matrix eigenvalue problem 



The script below shows how to tackle the problem using Mathcad 8. 
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Sample Huckel MO calculation using Mathcad 8.0 
ORIGIN := 1 H is the Huckel hamiltonian 

H^i := 0 H 22 := 0 the vector x contains the eigenvalues 

H-i 2 := 1 H 2i1 := 1 vectors vl, v2 contain the corresponding 

eigenvectors 



x := eigenvals (H) 

vl := eigenvec (H, x,) 

v2 := eigenvec (H, x 2 ) 


It turns out that ft is a negative quantity, so in the electronic ground state the first n MO 
corresponds to eigenvector v 2 .1 have therefore relabelled the results in Table 18.1. 


Table 18.1 Huckel calculation on ethane 


MO and energy 

Cl 

c 2 

Voa + fi 

0.7071 

0.7071 

fiOl-P 

0.7071 

-0.7071 


MO t/fi is doubly occupied. In organic-speak, this is the HOMO whilst i Jr 2 is the LUMO. 
The HOMO is i/q = 0.7071 (xi + Xi) and so the electron density is twice the square of 
this times —e. A simple calculation shows that the Mulliken population indices are, when 
collected as the matrix of charges and bond orders 


Each C atom formally contributes a single electron to the n system leaving a C + and so the 
overall charge on each C atom is 0. 

18.1.1 Treatment of Heteroatoms 

How do we deal with a heteroatomic system such as pyridine? We need new parameters 
for each heteroatom and a new parameter for each new bond. For historical reasons these 
are written in terms of the carbon parameters as 

otx = oe c + hxficc 

Pxy = kxrP cc 
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where the h x and the k XY have to be determined by appeal to experiment. For example, 
the nitrogen atom in pyridine ought to carry a larger negative charge because nitrogen is 
more electronegative than carbon and a direct calculation shows that h N should be positive. 
Also, the values are different for the N atom in pyridine (where the N atom formally 
donates one electron to the jt system) and pyrrole (where the N atom donates two electrons) 
(Figure 18.2). Typical values are shown in Table 18.2. Notice that CH 3 is treated as a n group. 



pyridine pyrrole 

Figure 18.2 Pyridine and pyrrole 


Table 18.2 Typical h and k parameters 


Thus for pyridine 
/ 


Atom X 

hx 

kcx 

C 

0 

■\ 

N (one ;r-electron) 

0.5 

0.8 

N (two ^-electrons) 

1.5 

1.0 

B 

-1.0 

0.7 

O (two ^-electrons) 

1.0 

0.8 

ch 3 

2.0 

0.7 


with the N atom labelled 1 

, the secular equations 

0.5 

0.8 

0 

0 

0 

0.8 

\ 

/c,\ 


( Cl 

0.8 

0 

1 

0 

0 

0 


C 2 


Cl 

0 

1 

0 

1 

0 

0 


c 3 


C 3 

0 

0 

1 

0 

1 

0 


c 4 

X 

c 4 

0 

0 

0 

1 

0 

1 


c s 


Cs 

0.8 

0 

0 

0 

1 

0 

/ 

V C 6 / 


\ C 6 


18.2 Extended Hiickel Theory 

Huckel’s calculations on planar conjugated systems were welcomed by the scientific com¬ 
munity of the 1930s and were heavily exploited for many years. You might like to glance 
at the classic text Molecular Orbital Theory for Organic Chemists (Streitwieser 1962) to 
get the flavour of those early days. 

Despite these successes, the natives were restless. Theoreticians were unhappy because of 
the arbitrary division of electrons into ‘core’ and ‘valence’ or ‘jr -electron’ with no apparent 
justification other than it worked; how did the 7r-electrons know the core electrons were 
there, if they were not formally included in the calculation? Organic chemists were unhappy 
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because most of their interesting molecules did not contain benzene rings and inorganic 
chemists were particularly aggrieved because there was no mention of inorganic molecules 
in Huckel’s theory. 

Huckel’s calculations referred to the so-called ;r-electron system. What few calculations 
had been done on the core had used a method described as the ‘linear combination of bond 
orbitals’, where Hiickel-type schemes were used to try and combine together (for example) 
sp 3 hybrid orbitals. 


18.2.1 Wolfsberg and Helmholtz 

The pioneering work of Max Wolfsberg and Lindsay Helmholtz on the electronic structure 
of the ions CrOj, MnOj and Cr0 4 _ is usually cited as the first serious attempt to apply 
the Hiickel model to inorganic complexes. The synopsis is given below (Wolfsberg and 
Helmholtz 1952). 

We have made use of a semi empirical treatment to calculate the energies of the molecular 
orbitals for the ground state and the first few excited states of permanganate, chromate and 
perchlorate ions. The calculations of the excitation energies is in agreement with the qual¬ 
itative features of the observed spectra, i.e. absorption in the far ultraviolet for C10 4 with 
two strong maxima in the visible or near ultraviolet for Mn0 4 and Cr0 4 _ with the chro¬ 
mate spectrum displaced towards higher energies. An approximate calculation of the relative 
/-values for the first two transitions in Cr0 4 2 and MnC> 4 ~ is also in agreement with experiment 


The authors point out that the tetrahedral ions P0 4 2- , SO4 2- and C10 4 _ show no visible 
or near-ultraviolet absorption whilst the ions X0 4 “ of the fourth row transition elements, 
which have the same number of valence electrons, show characteristic visible and near¬ 
ultraviolet absorption, namely two strong maxima with the corresponding peaks displaced 
towards shorter wavelengths (higher energies) as the atomic number of the central atom 
decreases. The oscillator strength (f ) relates to the area under an absorption peak and is 
also a theoretical measure of the intensity of a given transition. 

The extended Hiickel model treats all valence electrons, within the spirit of the n -electron 
model. Each molecular orbital is written as an LCAO expansion of these valence orbitals, 
which can be thought of as being Slater-type orbitals; these orbitals are very similar to 
hydrogenic ones except that they do not have radial nodes. Once again we can understand 
the model best by considering the HF-LCAO equations. 

For H and He, the atomic basis set consists of a single Is orbital. For Li through Ne, the 
inner shell electrons are treated as part of the core and the basis functions used are atomic 
2s, 2p x , 2p y and 2p z . For Na through Al, the inner shell is treated as part of the core and we 
consider only 3s, 3p x , 3p v and 3p z orbitals. For Si through Cl we have to decide on whether 
or not to include the atomic 3d orbitals in addition, and practice varies. Most authors 
included them, especially for hypervalent molecules such as C10 4 “. The diagonal elements 
of the HF-LCAO matrix are taken to be the negatives of the valence shell ionization energy 
(VSIE) for the orbital in question. You have probably met VSIE elsewhere; the orbital 
configuration for carbon ground state is written 

C : (Is) 2 (2s) 2 (2p) 2 
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which is arguably unhelpful when we want to describe the bonding in methane. We imagine 
a two-step process where one electron is promoted from the 2s to a 2p orbital giving 


C : (Is) 2 (2s) 1 (2p) 3 

The four valence orbitals are then mixed to form sp 3 hybrids, giving four equivalent orbitals 


C : (Is) 2 (sp 3 ) 4 


which is said to form the valence state. Sadly, it does not correspond to a known spectro¬ 
scopic state of the carbon atom and so you will anticipate that many learned discussions 
took place to try and assign an energy value to the diagonal matrix elements of extended 
Hiickel theory. Things are far, far worse for transition metal ions with variable valencies. 
Wolfsberg and Helmholtz went to great pains to try and get reasonable values for the diag¬ 
onal terms by comparison with electronic spectra. Modern thinking is that the parameters 
are simply parameters that should be chosen to give a good fit of some measurable physical 
property against experiment, and that they should take the same value for a given element 
irrespective of its chemical environment. 

Wolfsberg and Helmholtz did their calculations in terms of ‘group’ orbitals. Such orbitals 
are simply linear combinations of atomic orbitals that mirror the symmetry of the molecule. 
In those days, it was essential to reduce the size of any matrix eigenvalue problem to one 
that could be done by hand. We have moved on and do not generally bother with group 
orbitals any more, although an understanding of molecular symmetry is vital in order to 
understand the spectroscopic transitions. 

Once the diagonal elements have been assigned, the off-diagonal elements are calculated 
by the key formula 



(18.1) 


where S$ is the overlap integral between the two basis functions x, and x r Parameter k is 
a ‘constant’ that can be adjusted to give agreement with experiment; common experience 
suggests that 1.75 is a reasonable choice. 

As a variation on the theme, some early authors chose to write 



(18.2) 


To use the chlorate ion example cited by Wolfsberg and Helmholtz, we would these days 
use a basis set comprising 2s and 2p STOs on oxygen with 3s, 3p and 3d orbitals on chlorine 
giving a total of 25 basis functions. I have added d orbitals on Cl because it is hypervalent 
in the molecule under study and people argue that d orbitals are necessary in such cir¬ 
cumstances. That gives 16 doubly occupied valence shell molecular orbitals (made up as 
follows: 6 valence electrons from each oxygen, 7 from chlorine and 1 from the excess neg¬ 
ative charge). A standard modem extended Hiickel calculation with a tetrahedral geometry 
and bond length of 147 pm gave me the orbital energies listed in Table 18.3. You might get 
slightly different results depending on your choice of Hiickel parameters; I used the standard 
default values in Gaussian 03. Notice how close together the highest occupied orbitals are; 
the order obtained is a little different from that reported by Wolfsberg and Helmholtz, but 
that is not surprising. Unlike advanced theories where electron repulsion is taken explicitly 
into account, Hiickel excitation energies are given as differences in orbital energies. 
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Table 18.3 Extended Huckel treatment of 
CI0 4 “ (tetrahedral, R C i-o = 148 pm) 


Orbital 

Energy/F„ 

4t 2 

0.5234 

3a, 

0.4938 

3t 2 

-0.5340 

It, 

-0.5361 

1e 

-0.5487 

2t 2 

-0.5890 

2a, 

-0.7312 

1t 2 

-1.2324 

la, 

-1.3718 


Some transitions are allowed (and possibly strong) and some are forbidden (but often are 
present as weak bands). Without going into the details and the group theory, my prediction is 
that absorption bands for C10 4 “ should be seen at about 1 hartree(l hartree = 27.2114 eV), 
which is much too large compared to the experimental value of 6 eV. Perhaps you can see 
now why Wolfsberg and Helmholtz had to fiddle around so much in order to get decent 
agreement with experiment. C10 4 “ is of course colourless, so at least the prediction is on 
the right side of credibility. 


18.2.2 Roald Hoffmann 

Credit for extending the Huckel ideas to organic molecules belongs to Roald Hoffmann 
(Nobel prizewinner in 1981). The synopsis of his 1963 paper says it all (Hoffmann 1963). 

The Huckel theory, with an extended basis set consisting of 2s and 2p carbon and Is hydrogen 
orbitals, with inclusion of overlap and all interactions, yields a good qualitative solution of 
most hydrocarbon conformational problems. Calculations have been performed within the 
same parameterization for nearly all simple saturated and unsaturated compounds, testing a 
variety of geometries for each. Barriers to internal rotation, ring conformations, and geometrical 
isomerism are among the topics treated. Consistent a and n charge distributions and overlap 
populations are obtained for aromatics and their relative roles discussed. For alkanes and 
alkenes charge distributions are also presented. Failures include overemphasis on steric factors, 
which leads to some incorrect isomerization energies; also the failure to predict strain energies. 

It is stressed that the geometry of a molecule appears to be its most predictable property. 

Valence shell orbitals are used, and the same key formula (Equation (18.1)) is used to 
find the Hamiltonian. 


18.3 Pariser, Parr and Pople 

The next advances came in the 1950s, with a more systematic treatment of electron repulsion 
in n -electron molecules. These two keynote papers due to Rudolf Pariser and Robert Parr 
and to John Pople are so important that I will give the abstracts (almost) in full. Pople 
(1953) wrote: 
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An approximate form of the molecular orbital theory of unsaturated hydrocarbon molecules 
in their ground states is developed. The molecular orbital equations rigorously derived from 
the correct many-electron Hamiltonian are simplified by a series of systematic approximations 
and reduce to equations comparable with those used in the semi-empirical method based on 
an incompletely defined one-electron Hamiltonian. The two sets of equations differ, however, 
in that those of this paper include certain important terms representing electronic interactions. 
The theory is used to discuss the resonance energies, ionization potentials, charge densities, 
bond orders and bond lengths of some simple hydrocarbons. The electron interaction terms 
introduced in the theory are shown to play an important part in determining the ionization 
potentials... 

You should have picked up many of the key phrases. He started from the HF-LCAO equa¬ 
tions and made what is now known as the o — n separation approximation; the n'-electrons 
are treated separately and the effect of the remaining o -electrons is absorbed into the 
HF-LCAO Hamiltonian. The HF-LCAO equations have to be solved iteratively in order 
to get the HF-LCAO n -electron molecular orbitals, and many of the two-electron integ¬ 
rals (the ‘electronic interaction terms’) are retained. In order to take account of the effect 
of o — 7t separation, most integrals are calibrated by appeal to experiment. The ‘charges 
and bond orders’ are simply the Mulliken populations calculated with an overlap matrix 
equal to the unit matrix, and ionization energies are calculated according to Koopmans’ 
theorem. 

The second keynote paper by Pariser and Parr (1953) also gives a snapshot of the times, 
when there was a great deal of interest in the electronic spectra of conjugated molecules. 
They wrote: 

A semi-empirical theory is outlined which is designed for the correlation and prediction of the 
wavelengths and intensities of the first main visible or ultraviolet bands and other properties of 
complex unsaturated molecules, and preliminary application of the theory is made to ethylene 
and benzene. 

The theory is formulated in the language of the purely theoretical method of the antisym¬ 
metrized products of molecular orbitals (in LCAO approximation) including configuration 
interaction, but departs from this theory in several essential respects. First, atomic orbital 
integrals involving the core Hamiltonian are expressed in terms of quantities which may 
be regarded as semi-empirical. Second, an approximation of zero differential overlap is 
employed and an optionally uniformly charged sphere representation of atomic rr-orbitals 
is introduced, which greatly simplify the evaluation of electronic repulsion integrals and 
make applications to complex molecules containing heteroatoms relatively simple. Finally, 
although the theory starts from the rr-electron approximation, in which the unsaturated 
electrons are treated apart from the rest, provision is included for the adjustment of 
the cr-electrons to the rr-electron distribution in a way which does not complicate the 
mathematics. 

Once again you should have picked up many of the key phrases. We often speak of the 
PPP method in honour of its three originators. 


18.4 Zero Differential Overlap 

A key phrase that I have yet to explain is zero differential overlap (ZDO); the atomic orbital 
basis set is not rigorously defined but we can imagine it to comprise the relevant STO with 
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one atomic orbital per heavy atom. Hydrogen atoms do not enter into n -electron models. 
The basis functions are taken to be orthonormal, so we have 


/ 


Xt ( r )Xj M dr 


1 iff = j 
0 otherwise 


ZDO extends this idea to the two-electron integrals; if we have a two-electron integral of 
the type 


-J-i 


Xt (ri )Xj ( r i) dr. 


then the integral is set to zero unless i = j. So all two-electron integrals 


e 2 r 1 

-- / Xt (Ti )Xj (ri) —Xk (r 2 ) Xi (r 2 ) dr, dr 2 

47re 0 J r l2 


(18.3) 


are zero unless i = j and k = l. 

At first sight the basis functions are STO 2p„, and the remaining integrals can actually 
be calculated exactly. When Pariser and Parr first tried to calculate the ionization energies 
and spectra of simple conjugated hydrocarbons such as benzene with exact two-electron 
integrals, they got very poor agreement with experiment. They therefore proposed that the 
two-electron integrals should be treated as parameters to be calibrated against spectroscopic 
data. Once again, you have to remember that computers were still in their infancy and that 
any simple formula was good news. We had electromechanical calculators and log tables 
in those days, but nothing much else. One such popular formula was 


f Xi Oh )Xi Oh) — Xk Oh) Xt Oh) dt! dr 2 = - 


(18.4) 


where R ik is the separation between the orbital centres. All other two-electron integrals 
were taken as zero and the a ik term has to be fixed by appeal to spectroscopic experiment. 
The expression is what you would get for the mutual potential energy of a pair of electrons 
separated by distance R ik + a ik , and these repulsion integrals are traditionally given a 


Next we have to address the one-electron terms, and the treatment depends on whether 
the term is diagonal (i = j ) or off-diagonal (i ^ j). In standard HF-LCAO theory the 
one-electron terms are given by 

where the first term represents the kinetic energy of the electron and the second term gives 
the mutual potential energy between the electron and each nucleus. 

In Pariser-Parr-Pople theory, the off-diagonal elements are taken as zero unless the atom 
pairs are directly bonded. If the atom pairs are directly bonded the matrix element is given 
a constant value fi for each particular atom pair, which is certainly not the same value 
as the fi value in ordinary Hiickel theory. The value of either /3 is found by calibration 
against experiment. The one-electron diagonal terms are written so as to separate out the 
contribution from the nuclear centre I on which the atomic orbital is based (usually taken 
as minus the valence state ionization energy co r ) and the other nuclear centres J (usually 
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taken as -ZjYij, where Z is the formal number of electrons that an atom contributes to the 
7 x system). There is no problem with hydrocarbons in that all carbon atoms are assigned 
the same valence state ionization energies, irrespective of their chemical environment, but 
many papers were written to justify one particular choice of w t against another for nitrogen. 

For the record, the PPP HF-LCAO Hamiltonian h F for hydrocarbons and other first-row 
systems has dimension equal to the number of conjugated atoms and can be written 


K = «H + \PnYu + £ {r n ~ Zj) YiJ 


(18.6) 


K = h- 2 p m 


Here P is the usual matrix of charges and bond orders. The HF-LCAO equations are solved 
by the usual techniques. 

In those early days, no one dreamed of geometry optimization; benzene rings were 
assumed to be regular hexagons with equal C-C bond length of 140 pm. PPP calculations 
were state of the art in the early 1970s and consumed many hours of early computer time. 
Once again there was an argument as to the role of d orbitals on sulfur, and a number of 
long-forgotten schemes and methodologies were advanced. 


18.5 Which Basis Functions Are They? 

I seem to have made two contradictory statements about the basis functions used in semiem¬ 
pirical work. On the one hand, I have said that they are orthonormal and so their overlap 
matrix is the unit matrix; on the other hand, I have used overlap integrals to calculate certain 
integrals. 

Think of a Hiickel 7T-electron treatment of ethene, and call the carbon 2p T orbitals Xi 
and Xi■ The matrix of overlap integrals is 



where p is the overlap integral of the two atomic (STO) orbitals in question. The eigenvalues 
of this matrix are 1 ± p and the normalized eigenvectors are 



and 



A little matrix algebra will show that 


S = (1 +p)v 1 v[ + (1 -p)\ 2 vl 


Mathematicians have a rather grand expression for this: they talk about the spectral decom¬ 
position of a matrix. We can make use of the expression to calculate powers of matrices, 
such as the negative square root 
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s -l/2 = (i + py 1 ' 2 v,v; + (i - p )- 1 ' 2 v 2 \ t 2 


To cut a long story short, we regard the basis functions used in semiempirical calculations 
as related to ordinary STOs Xu Xi,- ■ ■ Xn by the matrix transformation 


(Xl *2 ••• Xn) S' 172 


(18.7) 


These have the property that they are orthonormal, yet resemble the ordinary STOs 
closely as possible, in a least-squares sense. 


18.6 All Valence Electron ZDO Models 

The early 7r-electron semiempirical models proved a great success, and they are still some¬ 
times encountered in current publications. Attempts to extend them to the a systems or 
to inorganic molecules met with mixed fortune for a variety of reasons, especially the 
following three. 

• If we draw a molecule and then arbitrarily choose an axis system, physical properties 
such as the energy should not depend on the choice of axis system. We speak of rotational 
invariance; the answers should be the same if we rotate a local molecular axis system. 

• Also, despite what one reads in elementary organic texts, calculated physical properties 
ought to be the same whether one works with ordinary atomic orbitals or the mixtures 
we call hybrids. 

• Finally, we should get the same answers if we work with symmetry-adapted combinations 
of atomic orbitals rather than the ‘raw’ atomic orbitals. Whatever the outcome, we will 
not get a different energy. 


18.7 CNDO 

Pople and Segal (1965) seem to be the first authors to have addressed these problems. 
The most elementary theory that retains all the main features is the complete neglect of 
differential overlap (CNDO) model. The first paper dealt mainly with hydrocarbons, and 
only the valence electrons were treated. The inner shells contribute to the core that modifies 
the one-electron terms in the HF-LCAO Hamiltonian. The ZDO approximation is applied 
to all two-electron integrals so that 



(18.8) 


is zero unless i = j and k — l. Suppose now that atoms A and B are both carbon, and 
so we take 2s, 2p x , 2p y and 2p z basis functions for either atom. In addition to the ZDO 
approximation, the CNDO model requires that all remaining two-electron integrals of type 
(18.8) involving basis functions on A and B are equal to a common value denoted y AB . 
Parameter y AB depends on the nature of the atoms A and B but not on the details of the 
atomic orbitals. This simplification guarantees rotational invariance. 
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Similar considerations apply to the one-electron terms 



When i = j and the basis function Xt is centred on nucleus A we write the integral 



and determine U from atomic spectral data. The remaining terms in the sum are called 
penetration integrals and written V KR . 

If i / j and the basis functions are on different atomic centres then all three-centre 
contributions are ignored. The remaining two-centre terms involving atoms A and B are 
written where S is the overlap integral and ft a ‘bonding’ parameter. The bonding 

parameter is nonzero only for bonded pairs. 

Collecting up terms and simplifying we find that a CNDO HF-LCAO Hamiltonian has 
elements 



hi = P° AB S,j ~ ^PuYab 


(18.9) 


A and B label atoms, i and j label the basis functions and P AA is the sum of the diagonal 
charge density matrix for those basis functions centred on atom A. 

The original parameter scheme was called CNDO/1. Electron repulsion integrals were 
calculated exactly, on the assumption that the basis functions were s-type STOs, and all 
overlap integrals were calculated exactly. The bonding parameters /3 AB were chosen by 
comparison with (crude) ab initio calculations on relevant small molecules, and a simple 
additivity scheme was employed: 


P° A b = P°a + Pb 


18.8 CNDO/2 

It turned out that CNDO/1 calculations gave poor predictions of molecular geometries, and 
this failing was analysed as due to the approximations made for U u and the penetration 
term lArs- These problems were corrected in CNDO/2; Tab is no longer calculated exactly; 
rather it is taken as -Z B y AB . The atomic terms become 



(18.10) 
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where /, and E, are valence state ionization energies and electron affinities. The HF-LCAO 
matrix elements are 


= Vi + Ed + ((Paa ~Za)~\ (P U ~ 1)) Yaa + J2 (*Wab - Vab) 

B^ A ( lt 

h l = Pab S H ~ \PjjY*a 


18.9 CNDO/S 


One faihng of the jr-electron models was their inability to treat n —*■ it* electronic trans¬ 
itions. The CNDO models gave a great breakthrough because they afforded for the first time 
a systematic treatment of both jr —» it* and n —»■ it* transitions in conjugated molecules. 
The treatment essentially followed the lines of CIS discussed in Chapter 17 but the first res¬ 
ults were in poor agreement with experiment in that they gave excitation energies that were 
much too large. The most significant improvement to CNDO for use in spectroscopy was 
given by the work of Del Bene and Jaffe (1968). Their major contribution was to modify 
the bonding integrals; think of a pair of bonded carbon atoms, with one pair of 2p orbitals 
pointing along the bond fa) and the remaining two pairs perpendicular to the bond fit). 
Del Bene and Jaffe introduced a new parameter k such that the 7r-type interaction was 
reduced. They essentially replaced the term with 


KbS« = \(P°a + R)Si, 
PIb^ = \k {Pl + Pl)s tJ 


(18.12) 


They recommend a value k = 0.585, which gives best agreement with experiment for 
many conjugated molecules. 


18.10 INDO 

The CNDO models make an unnecessary and draconian simplification to the two-electron 
integrals 

JI Xi fri )Xj (D) yXk (r 2 ) Xi (D) dr, dr 2 = 

That is to say, even if orbitals x, and Xj are on the same centre but i ^ j then the entire 
integral is taken to be zero. Direct calculation shows that the integrals can be far from zero. 
In fact, it is just one-centre integrals of this kind that give the correct ordering between 
atomic spectroscopic states. CNDO cannot distinguish between the 3 P, ] D and 'S states 
arising from orbital configuration 


C : (ls) 2 (2s) 2 (2p) 2 
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Pople et al. (1967) introduced the intermediate neglect of differential overlap (INDO) 
scheme, which retained monocentric repulsion integrals. Some of these integrals are taken 
as parameters to be determined by comparison with experiment rather than by direct 
calculation. 


18.11 NDDO (Neglect of Diatomic Differential Overlap) 


The next level of approximation along these lines is the NDDO model, developed by Pople 
and Beveridge (1970). All two-electron integrals 


// 


Xi ( r i)Xj (*i) — X* (r 2 ) Xi (r 2 ) dr, dr 2 


are retained when Xt and xj are on the same centre and both Xk and Xi are on a single 
different centre. NDDO never really caught on; it was overtaken by events as described in 
the following. 


18.12 The MINDO Family 

The CNDO/2, INDO and NDDO methods are now little used apart from providing initial 
estimates of the electron density for use in ab initio HF-LCAO iterations. Their significance 
is that they were the first family of semiempirical models which retained increasing numbers 
of smaller repulsion integrals in a systematic way. Even NDDO neglects the majority of 
two-electron integrals for real molecules, and so the models made HF-LCAO calculations 
on large systems feasible. They did not give very accurate results, perhaps because they were 
parameterized by comparing with low accuracy ab initio calculations. They also needed 
reliable geometries as input, because geometry optimization was still in its infancy. Pople 
and Beveridge’s (1970) book contains a listing of a Fortran CNDO/INDO code, and it is 
interesting to note that there is no mention of geometry optimization. 

The MINDO (modified INDO) models developed by Michael J.S. Dewar had a quite 
different philosophy; their objective was to develop a genuine reliable model for experi¬ 
mentalists. They realized that parameters had to be available for a wide range of atoms and 
the models had to deliver results to chemical accuracy in reasonable amounts of computer 
time. Attention was paid to reliable geometry optimization, so enabling the program to 
accept relatively crude initial geometries for unknown structures. 


18.12.1 MINDO/3 

The synopsis for the landmark MINDO/3 paper by Bingham et al. (1975) is terse: 

The problems involved in attempts to develop quantitative treatments of organic chemistry are 
discussed. An improved version (MINDO/3) of the MINDO semiempirical SCF-MO treatment 
is described. Results obtained for a large number of molecules are summarised. 
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MINDO/3 uses an s, p minimal basis set of STOs and the elements of the HF-LCAO 
matrix are 


hi = U u + J2 ( p m ~ \ P ‘,h) + E - Zb)Kab 

j on A ^ ' B^A 

hi = — -Pijh-ij if Xi and Xj on A 
hi = /z“ re — ^P ( j/ab otherwise 


(18.13) 


I have written the atomic Coulomb and exchange integrals 


e 2 f 1 

Yu = -t- / Xi ( r i)x< (ri) —Xj ( r 2 ) Xj (r 2 ) dr, dr 2 

4rre 0 J r u 

e 2 f 1 

hj = -- / Xi ( r i)X/‘ (ri) —X, (r 2 ) X; (r 2 ) dr, dr 2 

4rre 0 J r l2 


for simphcity of notation. The parameters for MINDO/3 were obtained in an entirely dif¬ 
ferent way from the CNDO/INDO/NDDO family; many quantities like the STO exponents 
were allowed to vary during the fitting procedure. The bonding parameter /f° B was allowed 
to vary, and experimental data such as enthalpies of formation and accurate molecular 
geometries were also used to get the best fit. 

An interesting feature was the treatment of core-core repulsions (the core in this 
case being identified with the nucleus plus any inner-shell atomic electrons). The simple 
Coulomb term 


Cab 


e 2 Z A Z 
47T£ 0 R APj 


(where Z A and Z B are the ‘effective’ nuclear charges) was modified for various complicated 
reasons to make it a function of the electron repulsion integrals: 


u “= ZA - >'“) (-^)) (1814) 


Here ckab is a dimensionless constant and it depends on the natures of the atoms A and B. 
For O-H and N-H bonds a slightly different scheme was adopted: 


Uah = Z A Z H ^Kah + -Tab^ a ah exp (18.15) 


18.13 MNDO 

MINDO/3 proved very successful but it had a number of limitations; enthalpies of formation 
of conjugated molecules were generally too positive, bond angles were not well predicted, 
and so on. Dewar and Thiel (1977) introduced the modified neglect of differential overlap 
(MNDO) model, which they based on NDDO wh il st retaining the philosophy of MINDO/3. 
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The core-core repulsions were further modified: 

U AB =-^—Z A Z B f ■s A (r l ).s'A (r,)—.v B (r 2 ).y B (r 2 )dr, dr 2 
4tt£ 0 J r 12 

(l + exp(-^)+exp(-“^)) 
with a different formula for OH and NH bonds. 


(18.16) 


18.14 Austin Model 1 (AMI) 

Next came AMI, due to Dewar et al. (1985). AMI was designed to eliminate the problems 
from MNDO caused by a tendency to overestimate repulsions between atoms separated 
by the sum of their van der Waals radii. The strategy adopted was to modify the core¬ 
core terms by multiplication of the Coulomb term with sums of Gaussian functions. In 
the original AMI paper, there are four terms in the Gaussian expansion. Each Gaussian 
is characterized by its position along the A-B vector and by its width. This significantly 
increased the number of parameters for each atom. 


18.15 PM3 

PM3 is the third parameterization of MNDO, and the PM3 model contains essentially all 
the same terms as AMI. The parameters for PM3 were derived by Stewart (1989) in a more 
systematic way than those for AMI, many of which were derived by ‘chemical intuition’. 
As a consequence, some of the parameters are quite different from those of MNDO but the 
two models seem to predict physical properties to the same degree of accuracy. 


18.16 SAMI 

The final 1993 offering in Michael Dewar’s name was semi-ab-initio model 1 (SAMI) 
(Dewar et al. 1993). In SAMI, two-electron integrals are calculated using a standard STO- 
3G basis set (and hence the appearance of ab initio in the title). The resulting integrals were 
then scaled, and the Gaussian terms in the core-core repulsions were retained in order to 
fine-tune the calculations. 


18.17 ZINDO/1 and ZINDO/S 

Michael Zerner and co-workers developed these models for transition metal compounds 
(Zerner 1991). ZIN DO/1 is a variant of INDO and ZINDO/S is an INDO-like method used 
to calculate electronic spectroscopic properties of molecules. 
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18.18 Effective Core Potentials 


The ground state of the silver atom has electron configuration 

Ag : (1 s) 2 (2s) 2 (2p) 6 (3s) 2 (3p) 6 (3d) 10 (4s) 2 (4p) 6 (4d) 10 (5s) 1 


and much of its chemistry can be explained in terms of the outer 5s electron. The remaining 
46 electrons seem to form an inert core and this suggests that we might try to model the 
silver atom as a nucleus of charge Z = 41 and an inner shell of 46 electrons. This idea was 
tried in the very early days using potentials such as 


«v . exp (—2 kr) 

- -b A - 

47 re 0 r r 


(18.17) 


where n w is the number of valence electrons and A and k are constants that have to be 
determined by fitting an atomic property. Use of such a potential means that we have 
eliminated most of the two-electron integrals. 

The idea of dividing electrons into groups is quite common in chemistry; in CNDO 
theory we treat the valence electrons only, without explicitly treating the core electrons. In 
ZDO 7r-electron models we treat the 7T -electrons explicitly and assume that the effect of the 
o -electrons can be taken into account in the parameter choice. There is a serious problem 
that I can explain by reference to silver. The 5 s orbital has to be orthogonal to all the inner 
orbitals, even though we do not use them (and do not necessarily want to calculate them). 

Many ab initio packages allow the use of effective core potentials (ECPs), which replace 
the atomic cores in valence-only ab initio calculations; traditionally they were represented 
as linear combinations of functions of the type 


r " exp (-«r 2 ) 


with coefficients and exponents determined by fitting the potential generated from accurate 
HF-LCAO wavefunctions. In recent years it has become fashionable to represent the core 
potentials by one- and two-term Gaussians obtained directly from the appropriate atomic 
eigenvalue equation. 


18.19 Problem Set 

A set of problems, together with solutions, is available on the associated website. 
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Electron Correlation 


I have mentioned electron correlation at several points in the text, and I gave an operational 
definition of correlation energy as the difference between the exact HF energy and the true 
experimental energy. There is a great deal of small print; in general we cannot obtain exact 
HF wavefunctions for molecules, only LCAO approximations. The Schrodinger treatment 
takes no account of the theory of relativity, whilst we know from simple atomic spectra 
that relativistic effects are non-negligible. We have to be careful to treat zero-point energy 
in a consistent manner when dealing with vibrations, and so on. 


19.1 Electron Density Functions 

I have put a great deal of emphasis on the electron density. The wavefunction for a molecule 
with n electrons will depend on the 3 n spatial electron coordinates r 1; r 2 , . . . , r„ together 
with the n spin variables s, (a or /f). Many authors combine space and spin variables into 
a composite x = rs and so we would write a wavefunction 

tf r (x 1 ,x 2 , ...,x„) 

According to the Born interpretation, 


9* (x!,x 2 ,... ,x„) & (x 1; x 2 ,... ,x n ) dti d«i dr 2 ds 2 ... dr„d^„ 


gives the probability of finding simultaneously electron 1 in spatial volume element dri 
with spin between s 1 and v, + d.v,, electron 2 in spatial volume element dr 2 with spin 
between s 2 and s 2 + ds 2 , . . . , electron n in spatial volume element dr„ with spin between 
s„ and s„ + d.v„. 
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As we noted earlier, this expression contains too much information and we ask instead 
about the probability that any given element will be occupied by one electron with the other 
electrons anywhere. This is often called the one-electron density function 

Pi (xO = n J <P* (x 1; x 2 ,... ,x„) 'I' (x l5 x 2 ,... ,x„)dr 2 di 2 • • • dr„ ds„ (19.1) 

The X! on the left refers to ‘point 1’ at which the density is to be evaluated rather than the 
coordinates of electron 1, the indistinguishability of the electrons being accounted for by 
the factor n. 

If we want to know the probability of finding an electron of either spin in the spatial 
element dri then we integrate over d.y, to give the charge density discussed in Chapter 16 
and measured by crystallographers: 

Pi (rj| = n J (Xi,x 2 ,..., x„) & (x 3 , x 2 ,..., xjdsi dr 2 ds 2 • • • dr„ di„ (19.2) 

It is written either P or P,; I have used P in earlier chapters. It also proves desirable to 
introduce probabilities for finding different configurations of any number of particles. Thus 
the two-electron (or pair) density function 

p 2 (xi, x 2 ) = n (n — 1) J <P* (x l5 x 2 ,..., x„) «P (x 1; x 2 ,..., x„)dr 3 ds 3 ... dr„ ds„ (19.3) 

determines the probability of two electrons being found simultaneously at points Xi and 
x 2 , spins included, whilst 

P 2 (x u x 2 ) = n(n — 1) J <P* (xi,x 2 ,...,x„) *P (x,,x 2 ,...,x„)ds , i ds 2 dr 3 d.s 3 ... dr„ d,s„ 

(19.4) 

determines the probability of finding them at points r 3 and r 2 in ordinary space, irrespective 
of spin. 

Many common one-electron properties depend only on Pi and since the Schrodinger 
equation only contains pair interactions we need not consider distributions higher than the 
pair functions. For a state of definite spin, as distinct from a mixed state, the one-electron 
density function has the form 


Pl ( Xl ) = P“ (r.) « 2 (a) + Pf (A) P 2 (a) (19.5) 

where the P’s are spatial functions. There are no cross terms involving both a and fi. 
In orbital models, the P’s are just sums over the squares of the occupied orbitals. The 
two-electron density function is also given by 


Pi (xi, x 2 ) = P““ (r 1 ; r 2 ) a 2 (ij) a 2 (s 2 ) + Pf (r 1; r 2 ) a 2 {s x ) f 2 fe) 
+Pf (ri,r 2 ) p 2 (si) a 2 fe) + Pf (r,,r 2 ) p 2 (sf) p 2 (s 2 ) 


(19.6) 
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19.1.1 Fermi Correlation 

The two-electron functions tell us how the two electrons are correlated. In the one- 
determinant HF model, we find that the P’s are related as follows: 

P? fa, r 2 ) = P\ (n) P\ (r 2 ) - X (r lt r 2 ) 

*<*.*>-*r<w*<w (197) 

where X is a function whose form need not concern us, except to note that it exactly 
cancels the first term when r, = r 2 . From these results we get a clear picture of the electron 
correlation shown by standard closed-shell HF theory. The form of P^ shows that there 
is no correlation between electrons of opposite spin, since the simultaneous probability is 
just the product of the individual ones. This is a defect of HF theory. Electrons of like spin 
are clearly correlated and they are never found at the same point in space, and HF theory 
is satisfactory here. This type of correlation arises from antisymmetry and applies to all 
fermions. It is usually called Fermi correlation. 


19.2 Configuration Interaction 

I have mentioned configuration interaction (Cl) at various points in the text, in the dihydro¬ 
gen discussion of Chapter 15 and in the CIS treatment of excited states in Chapter 17. The 
idea of modem Cl is to start with a reference wavefunction that could be a HF-LCAO closed 
shell wavefunction, and systematically replace the occupied spinorbitals with virtual ones. 
So if A, B,... represent occupied orbitals and X, Y,... represent virtual ones we would 
seek to write 

IfrCI _ ^hf + clV* + + ■■■ (19.8) 

The expansion coefficients can be found by solving the variational problem, in the usual 
way. This involves finding the eigenvalues and eigenvectors of the Hamiltonian matrix; 
usually one is only interested in a few of the electronic states, and methods are available 
for finding just a small number of eigenvalues and eigenvectors of very large matrices. 

In a complete Cl calculation we would include every possible excited configuration 
formed by promoting electrons from the occupied spinorbitals to the virtual ones; for a 
closed shell singlet state molecule with m electron pairs and n basis functions there are 

_n! ( n + 1)!_ 

in ! (m + 1)! (n -m)\{n-m+ 1)! 

possible Slater determinants. If we consider a HF/6-311G** wavefunction for benzene, we 
have m — 21 and n = 144, giving approximately 5 x 10 50 possibilities. We therefore must 
truncate the Cl expansion. 

In the CIS model we just take singly excited states. If the reference wavefunction is a 
closed-shall HF one, the single excitations do not mix with the ground state and CIS tends 
to be used as a low-level method for studying excited states. 

The next logical step is to include the double excitations. If this is done together with 
the single excitations we have the CISD {Cl singles and doubles ) model. If the double 
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excitations alone are included we have CID (Cl doubles). Such truncations can lead to a 
problem called size consistency that I can illustrate by considering a very simple problem, 
that of two neon atoms. Table 19.1 shows a CISD/6-311G* calculation for dineon at 5000 pm 
separation, together with the free atom energy. 


Table 19.1 Dineon calculations, 6-311C* basis set 


Method 

Dineon energy/F h at 5000 pm 

Atom energy/£ h 

HF-LCAO 

-257.0451061 

-128.52255305 

CISD 

-257.4466147 

-128.7283956 


The HF-LCAO energy of a pair of neon atoms at large separation is exactly twice the 
free atom value, but this is not the case for the CISD calculation. If we have an ensemble of 
n particles and their energy is n times the energy of a single particle, we say that the theory 
scales correctly (or that the method is size consistent). Full Cl calculations scale correctly, 
but truncated Cl expansions do not. 

After double excitations, quadruple excitations are next in order of importance. If singles, 
doubles, triples and quadruples are included the acronym becomes CISDTQ. 


19.3 Coupled Cluster Method 

The coupled cluster (CC) method was first used by physicists studying nuclear struc¬ 
ture. Bartlett’s (1989) review is fairly recent. The fundamental equation relates a HF 
wavefunction dr™ to the best possible wavefunction by 

V = exp (r) (19.9) 

The exponential operator is defined by a Taylor series expansion 

r (1910) 

and the cluster operator is defined as 

f = f 1 + f 2 + -- + T n (19.11) 

where n is the number of electrons in the molecule. The operators have the effect of replacing 
occupied spin orbitals in dr™ with virtual spin orbitals. T, performs all singly excited 
substitutions, T 2 performs all doubly excited configurations and so on until all n electrons 
have been promoted from filled to virtual spinorbitals. The effect of the exponential operator 
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on the HF wavefunction is to express it as a linear combination that contains <f fHF and all 
possible excitations of electrons from occupied to virtual spinorbitals. We write 

f 1 ^ HF = 

T 2 'P vf = c^<^ Y 

A,B,X,Y 

and so on, and the aim of the theory is to determine the numerical coefficients. It is usual 
to approximate the T operator by including only some of the terms, and it is generally 
accepted that the most important contribution is T 2 . This gives the coupled cluster doubles 
(CCD) method. Since we take 

exp (f) « exp (f 2 ) = 1 + T 2 + jTl + ■■■ (19.12) 

the resulting wavefunction contains the HF wavefunction together with double quadruple, 
hextuple, etc. substitutions. The CCD method is size consistent. 

The next step is to include the 7\ operator and so take 

exp (f) * exp (f, + %) = 1 + (f, + ti) + 2 (f, + Ji)“ + • • • (19.13) 

which gives CCSD. Inclusion of T 3 gives CCSDT in an obvious notation. 

Pople and co-workers have developed a quadratic Cl method, which lies between Cl 
and CC. This QCI method exists in size-consistent forms QCISD and QCISDT (Pople 
etaL 1987). 


19.4 M0lIer-Plesset Perturbation Theory 

The HF-LCAO model is thought to be satisfactory for treating many molecular properties 
near minima on the molecular potential energy surface. It cannot generally treat covalent 
bond breaking and making, but forms a useful starting point for the more advanced theories. 
The HF model averages over electron repulsions; each electron experiences an average 
field due to the remaining electrons plus the field due to the nuclei. In real life there are 
‘instantaneous’ electron repulsions that go as 

e 2 1 
47T£o Tjj 

A great advantage of the HF-LCAO model is that the equations can be solved to arbitrary 
precision, once a basis set has been defined. We therefore have a ‘zero-order’ problem 
with an exact solution, together with a perturbation. We discussed perturbation theory in 
Chapter 14 when dealing with the helium atom; the idea is to write the true Hamiltonian as 
that for a soluble, zero-order problem plus a perturbation 

H = H (0) + XH m 

The difference from Chapter 14 is that we now use a HF-LCAO wave function as the zero- 
order wave function and treat the difference between the true electron-electron repulsions 
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and the averaged ones from HF theory as the perturbation. This gives us Mpller-Plesset 
perturbation theory. The method had been in the literature since 1934, but its potential 
for molecular calculations was not appreciated until the 1970s. It is of immense historical 
interest because it gave us the first realistic route for a treatment of small-molecule correl¬ 
ation energy, and you ought to read the original paper. The synopsis is reproduced below 
(Mpller and Plesset 1934). 

A Perturbation Theory is developed for treating a system of n electrons in which the Hartree 
Fock solution appears as the zero-order approximation. It is shown by this development that 
the first order correction for the energy and the charge density of the system is zero. The 
expression for the second order correction for the energy greatly simplifies because of the 
special property of the zero order solution. It is pointed out that the development of the higher 
order approximation involves only calculations based on a definite one-body problem. 

The HF model averages over electron repulsions, and the HF pseudo-one-electron 
operator for each electron has the form (from Chapters 14 and 16) 

h F (r i ) = h m (r,) + J (r,) - '-k (r,) 

The unperturbed Hamiltonian is taken as the sum of the HF operators for each of the n 
electrons 


H (0) = fr) 


m iz (^ (1) to) + j to) - (r ; )) (19.1 


whilst the true Hamiltonian makes no reference to averaging: 




The first few key equations of the perturbation expansion, taken from Chapter 14 but 
simplified for our present discussion, are shown below. The electronic state of interest is the 
ground state, denoted </ /(0) , with energy e (0) . I have therefore dropped the (i) subscripts, since 
we are dealing with just the one electronic state. Promoting electrons from the occupied to 
the virtual HF spinorbitals gives the excited states, and we consider singe, double, triple, 
etc. excitations in the usual fashion of Cl: 

0)^(0) _ £ (0)^/(0) 

- £ <°>) = (>) - (1915) 

0) _ e (0)^ ip-(2) = e (2)^(0) _|_ gO)jfr(i) _ 

Since the zero-order Hamiltonian is a sum of HF operators, the zero-order energy is a 
sum of orbital energies 

e (0) = E £ ' 


( 19 . 16 ) 
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where the sum runs over the occupied spinorbitals. The first-order energy is 

e 0) = J V i0) H (l) V m dT (19.17) 

and adding e (1) to e (0) gives the full HF energy g HF . We therefore need to progress to second- 
order theory in order to give any treatment of electron correlation. The levels of theory are 
denoted MP2, MP3,..., MPn, where n is the order of perturbation theory. 

The second-order energy is given by 

I f^(0)^a)m dT f 

s« = y ---L (19.18) 

L -' £<°> - Sj 


and the first-order correction to the wavefunction is 

... dr 

md) — V l _ i _t j/ 

g(°) — Sj 1 


(19.19) 


where the sum runs over all excited states, written «/>,, energies Sj. The ground state *^ (0) is 
a HF wavefunction and so the integral vanishes for all singly excited states because of the 
Brillouin theorem. It is also zero when the excited state differs from ‘ I' {0) by more than two 
spin orbitals, by the Slater-Condon-Shortley rules. Hence we only need consider doubly 
excited states in order to find the second order energy. The e <2) numerator is nonnegative 
since it is the square of a modulus. The denominator is negative because *P (0) refers to the 
ground state andto excited states. The second-order energy correction is therefore always 
negative or zero. Higher orders of perturbation theory may give corrections of either sign. 

The MP1 energy is therefore identical to the HF energy and MP2 is the simplest prac¬ 
ticable perturbation treatment for electron correlation. It includes only the effect of double 
excitations. 

By similar considerations, the third, fourth and higher orders of perturbation theory can 
be determined. The terms rapidly become algebraically complicated and the higher orders 
are increasingly costly to apply. For n basis functions, HF-LCAO theory scales as « 4 , MP2 
as n 5 , MP3 as n 6 and MP4 as n 1 . These should be seen as theoretical upper bounds, since 
sophisticated use of symmetry and integral cut-offs mean that practical calculations need 
considerably less resource. 

A great simplifying feature of MP2 is that a full four-index transformation is not neces¬ 
sary, all we have to do is to semitransform the two-electron integrals and this leads to an 
immense saving in time compared to conventional Cl treatments. MP3 also includes only 
double excitations, whilst MP4 includes a description of single excitations together with 
some triples and quadruples. The triple contributions in MP4 are the most expensive. In the 
MP4(SDQ) variation we just include the least expensive singles, doubles and quadruple 
excitations 

Analytical energy gradients have been determined for MP2, MP3 and MP4 making for 
very effective geometry searching. I can illustrate some points of interest by considering 
two examples. First is the geometry optimization of ethane, shown in Table 19.2.1 ran all 
calculations from the same starting point on the molecular potential energy surface. The 
HF/STO-3G calculation would these days be regarded as woefully inadequate. 
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Table 19.2 Geometry optimization for ethene 



HF/STO-3C 

HF/6-311G** 

MP2/6-311G** 

MP2/cc-pVTZ 

®hf /Eh 

-77.0789547 

-78.0547244 

-78.0539661 

-78.0638977 

®MP2/ Eh 



-78.3442916 

-78.3993063 

£ corr / Eh 



0.2903255 

0.3354086 

Reel pm 

130.6 

131.7 

133.7 

133.2 

/? CH /pm 

108.2 

107.6 

108.5 

108.0 

HCH/° 

115.7 

116.7 

117.2 

117.3 

Cpu time/s 

28 

41 

70 

881 


There are two sources of error in the calculations: the choice of a finite basis set, and 
truncation of the MPn series at MP2. According to the literature, the basis set error is the 
larger of the two. One has to be careful to compare like with like and the results of the 
calculations should be compared with geometric parameters at the bottom of the potential 
energy well. Experimental values very often refer to the lowest vibrational energy level. The 
estimate of the correlation energy depends markedly on the quality of the basis set. Electron 
correlation studies demand basis sets that are capable of high accuracy, and a number of 
suitable basis sets are available in the literature. One popular family is the correlation 
consistent basis sets of Dunning, which go by acronyms such as cc-pV6Z. This contains 
seven s-type, six p-type, four d-type, two g-type and one h-type primitives. 

We usually find that MP2 calculations give a small but significant correction to the 
corresponding HF-LCAO geometries. It is probably more worthwhile using MPrc models 
to investigate effects that depend crucially on electron correlation; bond breaking and bond 
making phenomena spring to mind. Figure 19.1 shows representative potential energy 
curves for dihydrogen, following the discussion of Chapter 15. 



Distance/bohr 


Figure 19.1 Dihydrogen electron correlation 
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A 6-311G** basis set was used. The top curve is the HF-LCAO calculation, and shows 
the usual bad dissociation limit. The remaining three curves are (in descending order) MP2, 
MP3 and MP4(SDQ). The first point to notice is that MPra calculations are size consistent. 
The second point is that none of the curves appear to be approaching the correct dissociation 
limit of — \E h . It is reported in the literature that MPn calculations do not work particularly 
well for geometries far from equilibrium. The MPn series has not been proved to converge 
to the true energy, but calculations using very high orders (MP48) have suggested that 
convergence will eventually occur. 

Finally, MPrc calculations need a HF-LCAO wavefunction to act as the zero-order approx¬ 
imation. It follows that MPn can only be used for those molecules and electronic states where 
it is possible to find such a wavefunction. This rules out most excited states. 


19.5 Multiconfiguration SCF 

One problem with traditional Cl calculations is that excitations are to the virtual orbitals; the 
occupied orbitals are usually determined variationally by the HF-LCAO procedure whilst 
the virtual orbitals give poor representations of excited states. One solution to the problem 
is to write the ground-state wavefunction as a linear combination of Slater determinants 
and then use the variation procedure to optimize both the linear expansion coefficients and 
the HF-LCAO coefficients. That is, we write 

ip = ^ Cj ip. 

where t P ur is the Hartree-Fock wavefunction and an excited state, in the usual sense 
of Cl calculations. In certain cases the ratio of some of the Cj will be fixed by symmetry. 
The MCSCF model is regarded as highly specialized, with a small number of adherents. 
A literature search revealed an average of 50 papers per annum in primary journals over 
the last 10 years. 
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Density Functional Theory and the 
Kohn-Sham LCAO Equations 


The 1998 Nobel Prize for Chemistry was awarded jointly to John A. Pople and Walter 
Kohn, and the texts of their addresses on the occasion of the awards are reported in Pople 
(1999) and Kohn (1999). Pople’s name should be familiar to you if you have assiduously 
studied this text. Kohn opened his address with the following remarks. 

The citation for my share of the 1998 Nobel Prize in Chemistry refers to the ‘Development 
of Density Functional Theory’. The initial work on Density Functional Theory (DFT) was 
reported in two publications: the first with Pierre Hohenberg (P Hohenberg and W Kohn, 
Phys Rev 136 (1964) B864) and the next with Lu J Sham (W Kohn and L J Sham, Phys Rev 
140 (1965) A1133). This was almost 40 years after E Schrodinger (1926) published his first 
epoch-making paper marking the beginning of wave mechanics. The Thomas-Fermi theory, 
the most rudimentary form of DFT, was put forward shortly afterwards (E Fermi, Attl Accad 
Naz Lincei, Cl Sci Fis Mat Nat Rend 6 (1927) 602, L H Thomas, Proc Camb Phil Soc 23 
(1927) 542) and received only modest attention. 

It is worth spending a little time on the historical aspects before launching into modern 
DFT, and the exchange potential plays a key role in our discussion. If we consider a closed 
shell system with electron configuration 


(VO) 2 iir 2 f ... (tM 2 


then in Hartree’s original theory we would write the total wavefunction as a simple orbital 
product for which the electronic energy is 



( 20 . 1 ) 
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In Hartree-Fock (HF) theory we write the wavefunction as a Slater determinant and this 
gives an additional exchange term in the electronic energy 


£ ='= 2 E/^ 


(r i)h (r t ) fa (ri) d^ 


F XJ It 2 / f +* (Fl) ^ (Fl) $ (r >’ ri)lfo(ri) *s (ri) dr,dr 2 (20.2) 

-WJ fa (ri) fa (ri) g (ri.ri) fa (ri) fa (ri) dridr 2 


These expressions are general to Hartree and HF theory in that they do not depend on the 
LCAO approximation. In fact the Hartrees produced highly accurate numerical tables of 
radial functions with no LCAO coefficients in sight. The atomic problem is quite different 
from the molecular one because of the high symmetry of atoms and the HF limit can be 
easily reached by numerical integration. It is not necessary to invoke the LCAO approach. 
There has been a resurgence of interest in atomic HF calculations because astrophysicists 
want to study highly ionized species in the interstellar medium. They look to theory for 
their energy-level data because the species are hard to prepare and study in the laboratory. 

I have also described the LCAO versions, where the energy expressions are written in 
terms of the ‘charges and bond orders’ matrix P together with the matrices h (1) , J and K: 

e el , H = 7>(Ph,) + Vr(PJ) (20.3) 

for the Hartree model and 

£ CI .hf = Tr (Ph,) + ^Tr (PJ) - ^Tr (PK) (20.4) 

for the HF version. The J and K matrices depend on P in a complicated way, but the 
Hartree and HF electronic energies are completely determined from knowledge of the 
electron density P. 

There was initially a great deal of confusion about the extra term in the HF energy 
compared to the Hartree energy. The name ‘exchange term’ was coined and some authors 
tried to describe it in terms of a mysterious force called the ‘exchange potential’. 

In the meantime, solid-state physics had been developing along a quite different direction. 
Wigner and Seitz (1934a,b) suggested what is now called the ‘cellular method’ for handling 
the problem of computing crystal orbitals. These orbitals have energies that form continuous 
bands and such models are therefore known as energy band theories. 


20.1 Pauli and Thomas-Fermi Models 

There is nothing sinister about the exchange term; it arises because of the fermion nature 
of electrons. Nevertheless, it caused considerable confusion among early workers in the 
field of molecular structure theory and much effort was spent in finding effective model 
potentials that could mimic electron exchange. I discussed the earliest models of metallic 
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conductors in Chapter 12. The Pauli model is the simplest one to take account of the 
quantum mechanical nature of the electrons: the electrons exist in a three-dimensional 
infinite potential well, the wavefunction obeys the Pauli principle and at 0 K the pairs of 
electrons occupy all orbitals having energy less than or equal to energy e F (which of course 
defines the Fermi energy). The number N of conduction electrons can be related to e F and 
we find 



(20.5) 


Now NIL 3 is the number density of conduction electrons and so Pauli’s model gives a 
simple relationship between the Fermi energy and the number density of electrons. Physi¬ 
cists very often use the symbol n for the number density; it can vary at points in space and 
so we write n{ r) but in this simple model, the number density is constant throughout the 
dimensions of the metallic box: 


_ N 

= ^( 2 m e ) 3/2 ( £F ) 3 

3 h 2 


( 20 . 6 ) 


We now switch on an external potential U tx t (r) that is slowly varying over the dimensions 
of the metallic conductor, making the number density inhomogeneous. This could be (for 
example) due to the set of metallic cations in such a conductor, or due to the nuclei in a 
molecule. A little analysis suggests that the number density at position r should be written 

n (r) = ^ (2m e ) 3/2 (e F - U m (r)) 3/2 (20.7) 

3 hr 

This Thomas-Fermi relation therefore relates the number density at points in space to 
the potential at points in space. The number density of electrons at a point in space is just 
the charge density P, (r) discussed in several previous chapters and so we can write in more 
familiar chemical language 


P, (r) = — (2m e ) 3/2 (e F - t/ ext (r)) 3/2 (20.8) 

3 h 2 

Thomas and Fermi suggested that such a statistical treatment would be appropriate for 
molecular systems where the number of electrons is ‘large’ and in the case of a molecule 
we formally identify the external potential as the electrostatic potential generated by the 
nuclei. The Thomas-Fermi approach replaces the problem of calculating an /V-electron 
wavefunction by that of calculating the electron density in three-dimensional position space. 

Dirac (1930) studied the effects of exchange interactions on the Thomas-Fermi model, 
and discovered that these could be modelled by an extra term 

U x (r) = C (P 1 (r)) 1/3 (20.9) 

where C is a constant. R. Gaspar is also credited with this result, which Slater (1951) 
rediscovered, but with a slightly different numerical coefficient of |C. The disagreement 


340 Molecular Modelling for Beginners 


between Dirac’s and Slater’s numerical coefficient seems to have been first resolved by 
Gaspar (1954), and authors began to write the local exchange potential as 

U Xa (r) = Col (F, (r)) 1/3 (20.10) 


where a could take values between 2/3 and 1. This is the so-called Xa model. The use 
of the symbol a here is not to be confused with the use of the same symbol for a spin 
eigenfunction. 

Slater then had the ingenious idea of writing the atomic HF eigenvalue equation 
h ¥ (r) i[f (r) = sir (r) 

(h m (r) + J (r) — l -K (r)^ f (r) = ef (r) 
as 

( h m (r) +J(r) + U Xa (r)) ir (r) = ef (r) (20.11) 

and such calculations are usually referred to as atomic Xa-HF. The resulting Xa orbitals 
differ from conventional HF orbitals in one major way, namely that Koopmans’ theorem 
is no longer valid for every orbital and so the orbital energies cannot generally be used to 
estimate ionization energies. Koopmans’ theorem now applies only to the highest occupied 
orbital. A key difference between standard HF theory and density functional calculations 
is the way we conceive the occupation number v of each orbital. In molecular HF theory, 
the occupation number is 2, 1 or 0 depending on whether a given spatial orbital is fully 
occupied by two electrons (one of either spin), singly occupied or a virtual orbital. For a 
system comprising very many electrons we focus on the statistical occupation of each orbital 
and the occupation number becomes a continuous variable having a value between 0 and 2. 
The relationship between the electronic energy e e i and the occupation number of orbital i is 



so that the ionization energy from a particular orbital i/r, is given by 


( 20 . 12 ) 


£el (V; = 0) - £ el (V; = 1) = 


l E ‘ 


dV; 


Notice, however, that the energy is determined by the charge density P( r). When the 
calculations are performed, the resulting orbitals closely resemble those from standard HF 
theory and people use them in much the same way. 

At the time, such calculations of ionization energies had an edge over the traditional HF 
variety because they take account of the electron relaxation that can accompany ionization. 


20.2 Hohenberg-Kohn Theorems 

The Thomas-Fermi and Xa approaches were constructed as approximations to the quantum 
mechanical problem of calculating the electronic properties of a system of interest. The 
density functional theory of Hohenberg and Kohn, to be discussed, is in principle an exact 
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theory. The first keynote paper contains two fundamental theorems. The first theorem is 
what mathematicians call an existence theorem-, it proves that the ground-state energy of any 
electronic system is determined by afunctional of the electron density. The theorem means 
that in principle we only need to know the electron density in three-dimensional space 
and not the full wavefunction in order to calculate any ground-state property including the 
energy. 

The term functional is mathematical jargon; a function is a mapping from one set of 
numbers to another (Figure 20.1) whilst a functional is a mapping from a set of functions 
to a set of numbers. 



Set of numbers 


Set of numbers 


Figure 20.1 A function 


The proof of the theorem is quite simple, but will not be reproduced here. The paper 
is so important that you should see at least the abstract (Hohenberg and Kohn 1964). The 
authors use square brackets [...] to denote functionals. 

This paper deals with the ground state of an interacting electron gas in an external potential 
v(r). It is proved that there exists a universal functional of the density, F[n(r )], independent 
of v(r), such that the expression E = f v(r)n(r)dr + F\n(r)} has as its minimum value the 
correct ground state energy associated with v(r). The functional F\n(r)] is then discussed for 
two situations: (1) n(r) — n 0 + n'(r), n'/n 0 « 1, and (2) n(r) — (p(r/r 0 ) with <p arbitrary 
and r 0 —»■ oo. In both cases F can be expressed entirely in terms of the correlation energy 
and linear and higher order electronic polarizabilities of a uniform electron gas. This approach 
also sheds some light on generalized Thomas-Fermi methods and their limitations. Some new 
extensions of these methods are presented. 

The second theorem gives a variation principle for the density functionals; it states (in 
chemical language) that 

£ei[F(r)]>e el [P 0 (r)] (20.13) 

where P 0 is the true density for the system and P any other density obeying 

J P (r) dr = J P 0 (r) dr = N (20.14) 

where N is the number of electrons. The difficulty is that the Hohenberg-Kohn theorems 
give us no clue as to the nature of the density functional, nor how to set about finding it. 
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20.3 Kohn-Sham (KS-LCAO) Equations 

The Kohn-Sham paper gives us a practical solution to the problem, based on HF-LCAO 
theory; again, it is such an important paper that you should have sight of the abstract that 
in any case is self-explanatory (Kohn and Sham 1965). 

From a theory of Hohenberg and Kohn, approximation methods for treating an inhomogeneous 
system of interacting electrons are developed. These methods are exact for systems of slowly 
varying or high density. For the ground state, they lead to self-consistent equations analogous 
to the Hartree and Flartree-Fock equations, respectively. In these equations the exchange and 
correlation portions of the chemical potential of a uniform electron gas appears as additional 
effective potentials. (The exchange portion of our effective potential differs from the due to 
Slater by a factor of 2/3.) Electronic systems at finite temperatures and in magnetic fields are 
also treated by similar methods. An appendix deals with a further correction for systems with 
short-wavelength density oscillations. 

The Kohn-Sham equations are modifications of the standard HF equations and we write 

(h m (r) + J(r) + U x c (r)) f (r) = sf (r) (20.15) 

where f/ xc (r) is the local exchange-correlation term that accounts for the exchange phe¬ 
nomenon and the dynamic correlation in the motions of the individual electrons. We speak of 
the KS-LCAO procedure. It is usual to split [/ xc (r) into an exchange term and a correlation 
term and treat each separately. 

(h m (r) +J(r) + Ux (r) + U c «) * {r) = ef (r) (20.16) 

The electronic energy is usually written in DFT applications as 

£ei [P] = ei [P] + Sj [P] + e x [p ] + £c [P] (20.17) 

where the square brackets denote a functional of the one-electron density P( r). The first 
term on the right-hand side gives the one-electron energy, the second term is the Coulomb 
contribution, the third term the exchange and the fourth term gives the correlation energy. 
We proceed along the usual HF-LCAO route; we choose a basis set and then all that is needed 
in principle is knowledge of the functional forms of U x ( r) and U c ( r). It is then a simple 
matter in principle to modify an existing standard HF-LCAO computer code to include 
these additional terms. Either component can be of two distinct types: local functionals that 
depend only on the electron density at a point in space and gradient-corrected functionals 
that depend both on the electron density and its gradient at that point. 

In their most general form, the exchange and correlation functionals will depend on the 
density of the a and of the fi electrons, and the gradients of these densities. Each energy 
term e c and e x will therefore be given by a volume integral 

e x = J f x (PfPf grad P a , grad / J ^)dr 
e c = f fc (P a ,P p , grad P“, grad P p ) dr 

J V ' (20.18) 


where / is an energy density. 
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20.4 Numerical Integration (Quadrature) 

Given the functionals and the electron density, we have to integrate over the space of the 
molecule to obtain the energy. These extra integrals cannot usually be done analytically and 
so a numerical integration has to be done each KS-LCAO cycle. What we do is to replace 
the integral by a sum over quadrature points, e.g. 

£x/c = J2 H w a if {P%Pl grad (/>“), grad (/>“) ; r Ai ) (20.19) 


where the first summation is over the atoms and the second is over the numerical quadrature 
grid points. The w Ai are the quadrature weights and the grid points are given by the sum of 
the position of nucleus A and a suitable one-centre grid: 


The keynote paper for numerical integration is that due to Becke (1988a), and once again 
the paper is so important that you should see the abstract. 


We propose a simple scheme for decomposition of molecular functions into single-center 
components. The problem of three-dimensional integration in molecular systems thus reduces 
to a sum of one-center, atomic-like integrations, which are treated using standard numerical 
techniques in spherical polar coordinates. The resulting method is tested on representative 
diatomic and polyatomic systems for which we obtain five- or six-figure accuracy using a few 
thousand integration points per atom. 


Becke’s scheme rigorously separates the molecular integral into atomic contributions that 
may be treated by standard single-centre techniques. Numerical integration grids are usually 
specified as a specific number of radial shells around each atom, each of which contains a 
set number of integration points. For example, Gaussian 03 uses a grid designated (75,302) 
which has 75 radial shells per atom, each containing 302 points giving a total of 22650 
integration points. 

Over the years, a large number of exchange and correlation functionals have been 
proposed. 


20.5 Practical Details 

There are two versions of DFT, one for closed shell systems analogous to the Roothaan 
closed shell treatment, and one for the open shell case analogous to the UHF technique 
and we implement the Kohn-Sham LCAO (KS-LCAO) equations by including the relevant 
exchange-correlation term(s) instead of the traditional HF exchange term in the FIF-LCAO 
equations. That is, for the UHF case we write 

(h m (r) + J (r) + t>“ (r) + W c (r)) (r) = (r) 
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for the a-spin electrons and a similar expression for the /3-spin electrons. The elements of 
the a-spin exchange-correlation matrices are related to the energy density/ and the basis 
functions X by 

(t/x/c) u = / { J^XtXj + [ 2 ^grad + ^grad P> j .grad { XiXj ) J dr (20.20) 
where I have written 


Y ali = grad (P a ) grad (P p ) 

The /-spin electrons have a similar matrix. 

So for example, the Slater Xa exchange density is 

q / o \ 1/3 

/ (r) = --a* (-) (V (r)) 4/3 + (P» (r)) 4/3 ) (20.21) 


where a* is the Xa parameter, not to be confused with the spin variable. The derivatives 
are simple: 


dP“ 



( 20 . 22 ) 


with a similar expression for the /-spin term. 

By itself, the Slater Xa local term is not adequate for describing molecular systems. 
Becke (1988b) formulated the following gradient-corrected exchange functional: 


f = (n 4/3 g(x a ) + (p^ /3 g (^) 


where 


8(.x) = - 


3 

2 



(/>“)/ 


and x p 


bx 2 

1 + 6 bx sinh -1 x 

_ Ym 

(Pf ) 4/3 


(20.23) 


The derivatives are again straightforward. 

In a similar way there are local and gradient-corrected correlation functionals. One of 
the oldest correlation functionals is that due to Lee, Yang and Parr: 


/ = - 


4 a 

1 +d(P) 

9/ , 


rv rota 

dy a * 


p a pv 

p 


VL 

dy a 


- 2 n/3 ^ ) (3?r 2 ) 2/3 aba) (P) P a P r ‘ ((P“) 8/3 + (P“) 8/3 ) 

+ &r« < 20 - 24 > 


where 


co(P) = 


exp (-cP _1/3 ) 
1 + dP~P 3 


where a, b, c and d are constants. 

A modem reference that contains a wealth of detail for implementing DFT calculations 
is that of Johnson et al. (1993). 
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20.6 Custom and Hybrid Functionals 

Hybrid functionals are those that contain mixtures of the HF exchange with DFT exchange- 
correlation, whilst ab Initio codes usually give the user a choice of constructing their own 
linear combination. 


20.7 An Example 

We worked through a typical HF-LCAO calculation on L-phenylanine using the 6-311G** 
basis set in Chapter 16. The following output relates to a corresponding KS-LCAO single 
point calculation. The first part of the .gjf is: 

# b31yp/6-311G** Pop=full 
L-phenylanine single point 
0 1 

N 0.000 0.000 0.000 

H 0.000 0.000 1.010 

C 1.366 0.000 -0.483 


Following current fashion, I chose the Becke three-parameter exchange and the Lee- 
Yang-Parr correlation functionals (B3LYP). The usual iterative procedure is followed and 
the output is all but identical to that shown in Chapters 16 and 17; a major difference from 
HF-LCAO theory is that the numerical integration has to be done at the end of each cycle, 
although this is not shown explicitly. 

The electronic state of the initial guess is 1-A. 

Warning! Cutoffs for single-point calculations used. 

Requested convergence on RMS density matrix=l. 00D-04 within 128 cycles . 
Requested convergence on MAX density matrix=l. 00D-02 . 

Requested convergence on energy=5.00D-05 . 

No special actions if energy rises. 

SCF Done: E(RB+HF-LYP) =-554.845359196 A.U. after 8 cycles 

Convg = 0.4690D-05 -V/T= 2.0027 

S**2 = 0.0000 

The energy reported is the equivalent of the usual HF-LCAO energy. 

Next come the KS-LCAO orbital energies: 

(A) (A) (A) (A) (A) (A) (A) (A) (A) (A) 

The electronic state is 1-A. 

Alpha occ. eigenvalues-19.19197 -19.12622 -14.29542 -10.31653 -10.22146 

Alpha occ. eigenvalues-10.17833 -10.17422 -10.16835 -10.16719 -10.16615 

Alpha occ. eigenvalues-10.16444 -10.13667 -1.16057 -1.03817 -0.90705 

Alpha occ. eigenvalues-0.85012 -0.78882 -0.74037 -0.73113 -0.68902 

Alpha occ. eigenvalues-0.62748 -0.60433 -0.57744 -0.55764 -0.54022 

Alpha occ. eigenvalues-0.50460 -0.49534 -0.47506 -0.46924 -0.45013 
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Alpha occ. eigenvalues-0.43607 -0.42978 -0.40963 -0.40051 -0.38414 

Alpha occ. eigenvalues-0.37210 -0.35197 -0.34396 -0.33353 -0.30417 

Alpha occ. eigenvalues-0.27739 -0.25593 -0.24403 -0.23759 

and LCAO coefficients (not shown here). The LCAO orbitals tend to be used just like ordin¬ 
ary HF-LCAO orbitals, and many authors make use of the standard Mulliken population 
analysis indices: 

Mulliken atomic charges: 

1 

1 N -0.409568 

2 H 0.190855 

3 C -0.174658 

22 C -0.084815 

23 H 0.085758 

Sum of Mulliken charges= 0.00000 

Comparison with Chapter 17 shows that the Mulliken indices are somewhat different; there 
is nothing particularly sinister in this, we simply have to be careful to compare like with 
like when using such subjective quantities. Finally the properties and timing: 

Electronic spatial extent (au): <R**2>= 2207.1383 
Charge= 0.0000 electrons 

Dipole moment (field-independent basis, Debye): 

X= -0.0080 Y= 1.3326 Z = 0.0814 Tot= 1.3351 
Quadrupole moment ( field-independent basis, Debye-Ang) : 

XX= -73.9662 YY= -64.8347 ZZ= -74.2992 
XY= 1.0222 XZ= -3.5488 YZ= -3.4544 

Traceless Quadrupole moment (field-independent basis, Debye-Ang): 

XX= -2.9328 YY= 6.1987 ZZ= -3.2658 
XY= 1.0222 XZ= -3.5488 YZ= -3.4544 

Octapole moment ( field-independent basis, Debye-Ang**2 ) : 

XXX= 6.5417 YYY= 31.5703 ZZZ= -0.5142 XYY= 1.2511 
XXY= -0.4178 XXZ= -5.5372 XZZ= 12.1891 YZZ= 2.3353 
YYZ= 9.3170 XYZ= -6.7008 

Hexadecapole moment (field-independent basis, Debye-Ang**3 ): 

XXXX= -2011.8342 YYYY= -566.5190 ZZZZ= -187.2276 XXXY= -15.7192 
XXXZ= -56.5913 YYYX= 45.6503 YYYZ= -4.1547 ZZZX= -0.4688 
ZZZY= 0.4758 XXYY= -464.1778 XXZZ= -416.6522 YYZZ= -138.6556 
XXYZ= -47.1416 YYXZ= 18.4000 ZZXY= 3.7164 

N-N= 6.95840887186 8D+02 E-N=-2.687191007826D+03 KE= 5.533590136629D+02 

THERE ' S NOTHING WRONG WITH FOLLOWING A TRAIN OF THOUGHT. . . 

AS LONG AS IT ULTIMATELY PULLS INTO THE STATION. 

Job cpu time: 0 days 0 hours 12 minutes 2 0.0 seconds. 

File lengths (MBytes) : RWF= 52 Int= 0 D2E= 0 Chk= 10 Scr= 1 

The electric moments are similar to the HF values, and the increased cpu time comes 
about because of the numerical integrations. 
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Accurate Thermodynamic Properties; 
the G n Models 


A major objective of quantum chemical modelling is the prediction of molecular structure 
and energy. The enthalpies of formation of many molecules have been determined very 
accurately by precise experimental techniques such as calorimetry and spectroscopy, but 
there is a wide range of species, for example unstable transients, for which the uncertainty 
is large. Molecular ionization energies, electron affinities and proton affinities have also 
been well studied experimentally. Realistic calculations of thermodynamic data have to be 
capable of giving ‘chemical’ accuracy and the target is usually set as ±2kcal mol -1 for 
energy quantities. 

It has been recognized for many years that bond dissociation energies are poorly predicted 
by standard ab Initio HF-LCAO calculations. The development of MPn perturbation theory 
marked a turning point in the treatment of molecular correlation, and made the treatment 
of correlation energy at last possible for molecules of moderate size (but at a high cost 
in computer resource). The MPn methodology is usually implemented up to MP4 level, 
but the convergence of the perturbation series leaves something to be desired and for this 
reason a further quadratic Cl correction is usually applied. 

The effect of polarization functions is also important, and even f orbitals make a 
contribution to the total energies of first-row atoms. 


21.1 G1 Theory 

The Gaussian-1 ( G1) method was introduced by Pople et al. (1989), in order to systemat¬ 
ically investigate the shortcomings in the levels of theory known at that time. Here is the 
abstract of their paper: 
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A general procedure is developed for the computation of the total energies of molecules at 
their equilibrium geometries. Ab initio molecular orbital theory is used to calculate electronic 
energies by a composite method, utilizing large basis sets (including diffuse-sp, double-d 
and f-polarization functions) and treating electron correlation by Mpller-Plesset perturbation 
theory and by quadratic configuration interaction. The theory is also used to compute zero-point 
vibrational energy corrections. Total atomization energies for a set of 31 molecules are found to 
agree with experimental thermochemical data to an accuracy greater than 2 kcal mol -1 in most 
cases. Similar agreement is achieved for ionization energies, electron and proton affinities. 
Residual errors are assessed for the total energies of neutral atoms. 

The original paper treated first-row atoms and molecules, but the extension to second-row 

species followed quickly. I will illustrate the calculation with ethene; here is a suitable .gjf: 


Ethene G1 calculation 


-0.42918455 

0.10397920 

-1.49918455 

0.24608976 

-0.28707399 

1.31608976 


1.60944204 

0.68173712 

1.60944204 

2.78441933 

3.71212425 

2.78441933 


0.00000000 

0.00000000 

0.00000000 

0.00000000 

0.00000000 

0.00000000 


There are eight distinct stages in a G1 calculation; in the Gaussian 03 suite the steps are 
followed automatically once the G1 keyword is specified. 

1. An initial structure is obtained at the HF/6-31G* level of theory. A frequency check 
is also run as a matter of course, and the harmonic frequencies are stored for later use. 

1| l|UNPC-UNK|FOpt|RHF| 6-31G(d) |C2H4|PCUSER|07-Jan-2008|0||# Gl||Ethene 
G1 calculation!|0,1|C,-0.3292236947,-0.5703527276,0 . |H, 0.1799049547, 

-1.5185022354,0.|H,-1.4048899659,-0.6041905569,0.|C, 0.3292236947,0.5703 
527276,0.|H,-0.1799049547,1.5185022354,0.|H, 1.4048899659,0.6041905569,0. || 
Version=x86-Win32-GO3RevB. 01|State=l-AG|HF=-78.0317178|RMSD=4.844e 
-009|RMSF=1.122e-004|Dipole=0. ,0., 0 . |PG=C02H[ SGH(C2H4 ) ]||@ 

TRUST EVERYONE , BUT CUT THE CARDS . 

Job cpu time: 0 days 0 hours 0 minutes 23.0 seconds. 


2. The equilibrium structure is now revised to the MP2/6-31G* level of theory. All 
electrons are included in the correlation treatment; there are no frozen cores. 

l|l|UNPC-UNK|FOpt|RMP2-FU|6-31G(d)|C2H4|PCUSER|07-Jan-2008|0||#N GEOM= 
ALLCHECK GUESS=READ SCRF=CHECK MP2 ( FULL) /6-31G ( D) OPT=RCFC| |Ethene G1 
calculation!|0, l|C,-0.3335853571,-0.5780899752,0.|H,0.1808762111,-1.53 
31889839,0.|H,-1.4179417525,-0.6105921379,0.|C, 0.3335853571,0.57808997 
52,0. |H,-0.1808762111,1.5331889839,0.|H, 1.4179417525,0.6105921379,0. || 
Version=x86-Win32-G03RevB. 01|State=l-AG|HF=-78.0310673|MP2=-78.2942862 
|RMSD=8.954e-009|RMSF=3.758e-005|Dipole=0. ,0. ,0.|PG=C02H [SGH(C2H4 ) ] || 
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3. The geometry from step 2 is now used in a number of single point calculations start¬ 
ing with MP4/6-311G**. This energy is improved in four distinct stages and these four 
improvements are assumed to be additive. 

4. As a first correction we add diffuse s and p basis functions at the MP4 level of theory. 
These are known to be important for molecules with lone pairs. The correction is obtained 
by comparing the MP4/6-311+G** (—78.38425£ h ) and the MP4/6-311G** (—78.38239£ h ) 
energies. The correction is therefore —1.86 mE b . 

5. As a second correction we take account of polarization functions on nonhydrogens. 
This correction is found by comparing MP4/6-311G(2df) and MP4/6-311G** energies. 
This turns out to be —40.07m£ h . 

6. The third correction allows for the inadequacies of the MP4 treatment. We make an 
expensive QCISD(T)/6-311G** calculation. This gives — 1 ,15mE h . 

7. If we add these three corrections to the MP4/6-311G** energy, we should get an 
approximation to a full calculation at the QCISD (T)/6-311G**(2df) level of theory. One 
such large calculation has therefore been replaced by four smaller and cheaper ones. This 
energy is still incomplete because of remaining deficiencies in the basis set. A major diffi¬ 
culty is that the correlation energy between two spin-paired electrons in a typical single bond 
converges only slowly as increasing angular momentum quantum numbers are included in 
the basis set. G1 theory allows for these effects in an empirical manner and so the fourth 
correction is to add terms devised to give agreement between theory and experiment for 
the hydrogen atom and dihydrogen. These are taken to be 6.14m£ h for each valence elec¬ 
tron pair and 0.\9mE h for each unpaired electron, and are referred to as the higher level 
correction (HLC). In terms of the numbers of a- and /3-spin electrons 


Ae (HLC) /mE b = -0.19n a - 5.95n, (21.1) 

which comes out as —36.84 mE h for ethene. 

8. Finally, the harmonic frequencies obtained at the HF/6-31G* level of theory are 
scaled uniformly by the well-accepted factor of 0.8929 to give a zero-point correction 
of +48.90m£ h . 

The total energy G1 (at OK) is therefore —78.41402l£i, and similar calculations for 
H and C give —0.5£ h and —37.78464E h . Combining these three values gives an ethene 
atomization energy of 530.1 kcal mol -1 which compares well with the zero-kelvin JANAF 
thermodynamic value of 531.9 kcal mol -1 . 

Final output contains extra useful thermodynamic data and is shown below. A simple 
restart gives rapid results for different temperatures and pressures. 


Temperature= 
E(ZPE)= 

E(QCISD(T) ) = 
DE(Plus)= 

G1 ( 0 K) = 

G1 Enthalpy= 


298.150000 

0.048894 

-78.384142 

-0.001864 

-78.414021 

-78.410020 


Pressure= 

E(Thermal)= 
E(Empiric)= 
DE(2DF)= 

G1 Energy= 


1.000000 

0.051951 

-0.036840 

-0.040069 

-78.410964 


G1 Free Energy= -78.435526 


21.2 G2 Theory 

G1 theory was originally tested against experimental values for a range of simple first- 
and second-row molecules. It was observed that G1 theory did badly with ionic molecules, 
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with triplet state molecules and with hypervalent molecules. Gaussian 2 (G2) theory was 
introduced by Curtiss et al. (1991), and it eliminates some of the difficulties by making 
three improvements to Gl. 

1. G2 theory eliminates the assumption of additivity of the diffuse sp and the 2df basis 
functions used in Gl theory. This change gives a significant improvement for ionic 
species and some anions. 

2. It adds a third d function to the nonhydrogen atoms and a second p function to the hydro¬ 
gen atoms. The third d function is especially important for some hypervalent molecules 
such as S0 2 , whilst the second p function significantly improves the atomization energy 
of some hydrogen-containing molecules. 

3. The higher level correction is determined by a least squares fit for 55 molecules rather 
than just the hydrogen atom and dihydrogen. This also contributes to an improvement 
in calculated energies. 

In fact, G2 only requires one extra calculation above Gl, namely MP2/6-311+G(3df,2p). 

A comparison was made for 79 well-established molecules, including 43 that were 
not included in the original Gl papers. The final total energies are essentially at the 
QCISD(T)/6-311+G(3df,2p) level of theory. 


21.2.1 G2(MP2) 

It was subsequently found that significant savings in computer resource could be obtained 
at little cost in accuracy by reducing the order of the MP4 calculation to MP2 (giving the 
G2 (MP2) variant). 


21.3 G3 Theory 

A recent reassessment by Curtiss et al. (1998) of G2 theory used 302 energies which 
included 148 enthalpies of formation, 88 ionization energies, 58 electron affinities and 8 
proton affinities for larger and more diverse molecules. This revealed some interesting dis¬ 
crepancies. For example, the enthalpy of formation of CF 4 is too positive by 7.1 kcal mol -1 
whilst that of SiF 4 is too negative by 5.5 kcal mol" 1 . The deviations were also much lar¬ 
ger for unsaturated systems than for saturated ones. These considerations led Curtiss et al. 
(1999) to propose Gaussian-3 (G3) theory, which follows along the same lines as the earlier 
Gl and G2 theories in that it is a well-defined sequence of ab Initio steps to arrive at the 
total energy of a given molecule. As will be seen from the discussion below, G3 differs 
from G2 theory in several major ways. 

1. An initial HF/6-31G(d) equilibrium structure is obtained using the RFIF or UHF 
treatment (as in G2 theory). 

2. HF/6-31G(d) structure is used to find vibration frequencies that are then scaled by 
0.8929. The zero-point energy is calculated (as in G2 theory). 

3. Equilibrium geometry is refined at MP2/6-31G(d) level, including all electrons. This 
geometry is then used in single point calculations (as in G2 theory). 
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4. The first higher level correction is a complete MP4/6-31G(d) single point, which is then 
modified to allow for diffuse functions, higher polarization functions, higher correlation 
terms using QCI, larger basis sets and the effect of nonadditivity. 

5. The MP4/6-31 G(d) energy and the four corrections are combined along with a spin-orbit 
correction. 

6. The usual higher level correction is added to take account of remaining deficiencies: 

As (HLC) = —Aria — B(n a — ria) for molecules 

( 21.2) 

As (HLC) = —Crip —D(n a — rip) for atoms 

A through D are constants chosen to give the smallest average deviation from experiment 
for the G2 test set. 

7. The total energy at 0 K is obtained by adding the zero-point energy from step 2. 

The overall agreement with experiment for 299 test energies is 1.02 kcal mol -1 compared 
to 1.48 kcal mol -1 for G2 theory. Use of MP3 rather than MP4 gives a saving in computer 
resource but the agreement with experiment becomes 1.22 kcal mol -1 . 

21.3.1 G3 Variations 

In view of the discussion for G2, you will not be surprised to learn that there is a G3(MP2) 
version which aims to give a saving in computer resource by reducing the MP4 basis set 
extension calculations to MP2. 

Given the steady rise in density functional theory, you might like to read the work of 
Baboul et al. (1999), who use B3LYP geometries and frequencies. The authors report that 
the use of density functional geometries and zero-point energies in G3 and G3(MP2) is a 
useful alternative to MP2 geometries and HF zero-point energies. 

There have been a number of variations on similar themes over the years, and these are 
nicely summarized in Baboul et al. (1999). 
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Transition States 


Geometry optimization plays a key role in modern molecular modelling. Ah Initio packages 
invariably contain powerful options for geometry optimization that can locate stationary 
points on a molecular potential energy surface starting from naive back-of-an-envelope 
molecular sketches. Whether that stationary point is the global minimum is of course open 
to question; you simply have to try a lot of possible starting geometries and use chemical 
intuition. 

Most current methods for geometry optimization are based on a Taylor series expansion 
of the energy about a reference point X 0 ; the energy at point X is given in terms of the 
gradient g and the Hessian H by 

£ = e 0 + (X - X 0 ) T g + 1 (X - X 0 ) T H(X - X 0 ) + • ■ ■ (22.1) 

Newton-Raphson type methods truncate the energy expression at the quadratic term, and 
tell us that the best step to take in order to reach a stationary point is 

X - X 0 = —H ‘g (22.2) 

A great deal of effort has gone into determining explicit expressions for the gradient 
and the Hessian at many levels of theory (such as HF and MP2). Calculations start with 
initial estimates of the gradient and the Hessian, and an iterative calculation follows. As 
noted earlier, many methods choose to increment the inverse H rather than H as the 
optimization progresses. 

The efficiency of a geometry optimization therefore depends on a number of factors: the 
initial geometry, the choice of coordinate system, the initial Hessian, the Hessian updating 
method and finally the search direction and step size. 

Minima correspond to points on the molecular potential energy surface where the eigen¬ 
values of the Hessian matrix are all positive. Transition states are characterized as stationary 
points having just one negative eigenvalues of H. The requirement for a single negative 
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eigenvalue of H means that one has to be much more careful with the step taken in an optim¬ 
ization. For example, it is always possible to take a steepest descent step in a minimum 
search, which will lower the energy or leave it unchanged. Such a step is not appropriate 
for transition state searching; it is therefore harder to find a transition state than a minimum 
and many Newton-Raphson type procedures need to start the optimization fairly close to 
the true transition state structure. 

The key concept in our discussion is therefore the Hessian matrix H. This is a real 
symmetric matrix and the eigenvalues and eigenvectors can be found by standard numer¬ 
ical methods. Simons et al. (1983) showed that each Newton-Raphson step is directed 
in the negative direction of the gradient for each eigenvector that has a positive Hessian 
eigenvalue, and along the positive direction of the gradient for each eigenvector that has a 
negative eigenvalue. 

For a transition state search, if you are in a region of the molecular potential energy surface 
where the Hessian does indeed have one negative eigenvalue, the Newton-Raphson step 
is appropriate. If you have landed on some region of the surface where the Hessian does 
not have this desired property, then you must somehow get out of the region and back to a 
region where the Hessian has the correct structure of one negative eigenvalue. 

For minima, qualitative theories of chemical structure are a valuable aid in choosing 
starting geometries whilst for transition states one only has a vague notion that the saddle 
point geometry must lie somewhere between the reactants and products. Molecular mech¬ 
anics (MM) structures are often good as starting structures for geometry optimizations but 
MM cannot handle bond making and bond breaking. 

One of the earliest algorithms that could take corrective action when the wrong region of 
the molecular potential energy surface was chosen was that due to Poppinger (1975). This 
author suggested that the lowest eigenvalue of the Hessian should be followed ‘uphill’. The 
technique has come to be known as eigenvector following. There is a good discussion of 
eigenvector following in Baker’s (1986) paper, which has the following abstract. 

An algorithm for locating transition states designed for use in the ab initio program package 
GAUSSIAN-82 is presented. It is capable of locating transition states even if started in the 
wrong region of the energy surface, and, by incorporating the ideas on Hessian mode following 
due to Cerjan and Miller, can locate transition states for alternative rearrangements/dissociation 
reactions from the same initial starting point. It can also be used to locate minima. 

Baker refers to eigenvector following as ‘Hessian mode following’. The big problem with 
eigenvector following is that the remaining directions along the potential energy surface 
are left in isolation until the initial fault is corrected. 

If we think of a chemical reaction (either reactants or products) as minima on the molecu¬ 
lar potential energy surface, then there is no unique way of moving uphill on the surface 
since all directions go uphill. The linear synchronous transit (LST) algorithm searches 
for a maximum along a path between reactants and products. It frequently yields a struc¬ 
ture with two or more negative Hessian eigenvalues, and this is not a transition state. The 
quadratic synchronous transit (QST) method searches along a parabola for a minimum in 
all directions perpendicular to the parabola. The following is the abstract of Halgren and 
Lipscomb’s (1977) original reference for synchronous transit. 

In the synchronous-transit method, a model linear synchronous transit pathway is first con¬ 
structed and is then refined by optimizing one or more intermediate structures subject to the 
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constraint that the optimized structure retain the same relative position along the path (ortho¬ 
gonal optimization). The method yields a series of energy estimates which progressively bound 
the energy of the transition state from above and from below. High computational efficiency is 
attainable, and sufficient flexibility is provided to deal with asynchronous processes. Compar¬ 
isons are made to the alternative ‘reaction coordinate’ approach, which is shown to be subject 
to several serious deficiencies. The method is applied to a model two-dimensional energy sur¬ 
face and to the allowed electrocyclic interconversions of the cyclopropyl and allyl cations and 
of cyclobutadiene and civ-butadiene. 

As we saw in earlier chapters, early work in geometry optimization using electronic 
structure methods used nonredundant internal coordinates (e.g. the Z-matrix). Pulay et al. 
(1979) demonstrated clearly that redundant internal coordinates are the best choice for 
minimizing many molecules. Peng et al. (1996) reported similar conclusions for transition 
state structures, especially when the initial steps of such calculations are guided by QST. 


22.1 An Example 

Gaussian 03 includes the synchronous transit-guided quasi-Newton method; that is, it uses 
QST to get close to the quadratic region of the transition state and then quasi-Newton- 
Raphson to complete the optimization. There are two flavours: Opt = QST2 requires 
reactants and products as input structures, whilst Opt = QST3 requires in addition the 
initial transition state. 

The isomerization reaction 

HCN -> HNC 

has been used frequently to test transition state optimizations and reaction path algorithms. 
Here is the QST3 input: 


%chk=c:/hen.chk 


#mp2/6-311g(2d,lp) opt=qst3 freq 


HCN (Reactant) 

0 1 

C 0.0 0.0 -0.506777 

H 0.0 0.0 -1.574099 

N 0.0 0.0 0.659252 


HNC (Product) 

0 1 

C 0.0 0.0 -0.743084 

H 0.0 0.0 1.432195 

N 0.0 0.0 0.432330 


HCN --> HNC Transition State H above CN 
0 1 

C 0.0 0.0 -0.506777 

H 0.0 1.07 -0.506777 

N 0.0 0.0 0.659252 


It is wise to allow for electron correlation when dealing with reactions that involve 
bond breaking and bond making. I first optimized HCN and HNC and then made 
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a guess at the transition state (hydrogen above carbon). Here is a fragment of the 
output: 


QST in optimization variable space. 



Tangent TS vect // Eig F Eigenval 

1 R1 

0.47182 

0.60987 

0.00000 

-0.07462 

2 R2 

-0.13266 

-0.01687 

0.00000 

0.16000 

3 R3 

-0.61297 

-0.48257 

-0.00003 

0.36068 

4 A1 

0.43494 

0.44104 

0.00000 

1000.00000 

5 A2 

0.00000 

0.00000 

0.00000 

1000.00000 

6 A3 

-0.44146 

-0.44765 

0.00000 

1000.00000 

7 A4 

0.00000 

0.00000 

0.00000 

1000.00000 


RFO step: Lambda0=4.157509059D-11 Lambda=0.00000000D+00 . 
Linear search not attempted -- option 19 set. 

Iteration 1 RMS(Cart)= 0.00001745 RMS(Int)= 0.00000000 
Iteration 2 RMS(Cart)= 0.00000000 RMS(Int)= 0.00000000 


Variable 


R1 

R2 

R3 

A1 

A2 

A3 

A4 


Old X -DE/DX 

(Linear) 
2.47517 -0.00002 

2.21560 -0.00001 

2.61751 -0.00002 

5.10378 0.00000 

3.14159 0.00000 

5.21958 0.00000 

3.14159 0.00000 


Delta X 
(Quad) 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 


Delta X 
(Total) 
-0.00003 
-0.00004 
-0.00006 
0.00001 
0.00000 
- 0.00001 
0.00000 


Delta X 

-0.00003 

-0.00004 

-0.00006 

0.00001 

0.00000 

- 0.00001 

0.00000 


New X 

2.47514 

2.21556 

2.61745 

5.10379 

3.14159 

5.21957 

3.14159 


Maximum Force 
RMS Force 
Maximum Displacement 0.000030 
RMS Displacement 0.000017 


Value Threshold 
0.000017 0.000450 

0.000010 0.000300 

0.001800 

0.001200 


Predicted change in Energy=-9.906105D—10 


Converged? 

YES 

YES 

YES 

YES 


Optimization completed. 

-- Stationary point found. 


! Optimized Parameters ! 
! (Angstroms and Degrees) ! 


! Name 

Definition 

TS 

Reactant 

Product 

Derivative Info. 

! R1 

R(l,2) 

1.3098 

1.0673 

2.1753 

-DE/DX = 0.0 

! R2 

R(l,3) 

1.1724 

1.166 

1.1754 

-DE/DX = 0.0 

! R3 

R(2,3) 

1.3851 

2.2334 

0.9999 

-DE/DX =0.0 

! A1 

L(2,1,3,-1,-1) 

292.425 

180.0 

180.0 

-DE/DX =0.0 

! A2 

L(2,1,3,-2,-2) 

180.0 

180.0 

180.0 

-DE/DX =0.0 

! A3 

L(l,3,2,-1,-1) 

299.0599 

180.0 

180.0 

-DE/DX =0.0 

! A4 

L(1,3,2,-2,-2) 

180.0 

180.0 

180.0 

-DE/DX =0.0 
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from which we note that a (triangular) stationary point has indeed been found. But is it a 
transition state? I included the freq control word to check, and sure enough 


****** i imaginary frequencies (negative Signs) ****** 

Harmonic frequencies (cm**-l), IR intensities (KM/Mole), Raman scatter- 

activities (A**4/AMU), depolarization ratios for plane and unpolarized 
incident light, reduced masses (AMU), force constants (mDyne/A), 
and normal coordinates: 


Frequencies -- 
Red. masses - - 
Frc consts -- 
IR Inten - - 


A' 

-1004.9940 

1.1934 

0.7102 

79.7952 


A' 

2058.3451 

1.0539 

2.6307 

218.6471 


3 

A' 

2151.5200 

11.8266 

32.2554 

5.1097 


Atom AN X Y Z 

1 6 0.08 0.06 0.00 

2 1 0.16 -0.98 0.00 

3 7 -0.08 0.02 0.00 


X Y Z 

-0.06 0.00 0.00 
0.99 0.13 0.00 

- 0.02 - 0.01 0.00 


X Y Z 

-0.01 0.72 0.00 

-0.18 0.24 0.00 

0.02 -0.63 0.00 


22.2 The Reaction Path 

A transition structure is a maximum on the reaction pathway, but for some reactions the 
potential energy surface can be complicated. It may not be obvious whether the transition 
state connects the desired reactants and products and in such cases it is advisable to follow 
the reaction path from the transition state to the reactants and to the products. A reaction 
path can be defined for any coordinate system as the minimum energy path from the 
transition state down to the reactants and to the products; if the coordinates are mass- 
weighted Cartesians this path is the intrinsic reaction coordinate (IRC). The key reference 
for IRC calculation is the paper by Gonzalez and Schlegel (1989), and this algorithm is 
implemented in Gaussian 03. The following is part of the abstract. 

A new algorithm is presented for obtaining points on a steepest descent path from the transition 
state of the reactants and products. In mass-weighted coordinates, this path corresponds to the 
intrinsic reaction coordinate. Points on the reaction path are found by constrained optimizations 
involving all internal degrees of freedom of the molecule. The points are optimized so that the 
segment of the reaction path between any two adjacent points is given by an arc of a circle, and 
so that the gradient at each point is tangent to the path. Only the transition vector and the energy 
gradients are needed to construct this path. The resulting path is continuous, differentiable and 
piecewise quadratic... 

We now take the HCN —> HNC example. 

—Linkl— 

%chk=c:/hen.chk 

# mp2/6-311g(2d, lp) irc=rcfc geom=check 
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HCN TS 
0 1 

This --Links-- continues our HCN isomerization reaction discussed above. By default, 
Gaussian 03 IRC calculations step six points in mass-weighted increments in the ‘forward’ 
direction and six in the ‘reverse’. Force constants have to be provided (or recalculated), 
hence the ref c option. 


Optimization completed. 

— Optimized point # 12 Found. 


! Optimized Parameters ! 
! (Angstroms and Degrees ) ! 


! Name Value Derivative information (Atomic Units) ! 


XI 

Y1 

Z1 

X2 

Y2 

Z2 

X3 

Y3 

Z3 


0.0 —DE/DX =0.0 

-0.2539 -DE/DX = -0.0158 

-0.603 -DE/DX = -0.007 

0.0 -DE/DX =0.0 

0.9864 -DE/DX = 0.0035 

-0.1426 -DE/DX = 0.0149 

0.0 -DE/DX =0.0 

0.0715 -DE/DX = 0.0124 

0.5549 -DE/DX = -0.0079 


RADIUS OF CURVATURE = 1.09348 

NET REACTION COORDINATE UP TO THIS POINT = 0.58614 

# OF POINTS ALONG THE PATH =12 

# OF STEPS = 2 


Summary of reaction path following: 
(Int. Coord: Angstroms, and Degrees ) 


9 

10 
11 


ENERGY 

-93.14253 

-93.14067 

-93.13882 

-93.13706 

-93.13522 

-93.13232 

-93.13097 

-93.13388 

-93.13711 

-93.13936 

-93.14130 


RX.COORD 
-0.56420 
-0.46423 
-0.36429 
-0.26487 
-0.16762 
-0.06849 
0.00000 
0.08959 
0.18889 
0.28717 
0.38640 


XI 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 


Y1 

-0.24523 

-0.24115 

-0.23715 

-0.23350 

-0.23137 

-0.23122 

-0.23507 

-0.23876 

-0.23961 

-0.24193 

-0.24558 


Z1 

-0.60183 

-0.59957 

-0.59692 

-0.59337 

-0.58878 

-0.58392 

-0.58223 

-0.58527 

-0.59000 

-0.59446 

-0.59813 
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12 -93.14313 0.48621 0.00000 -0.24970 -0.60087 

13 -93.14485 0.58614 0.00000 -0.25393 -0.60301 

The first part of this fragment shows the final geometry optimization, followed by all 13 
points along the IRC path. Point 1 is seen to correspond to HCN, point 7 to the transition 
state and point 13 to HNC which demonstrates that the transition state really does join HCN 
and HNC. 
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Dealing with the Solvent 


Chemical reactions generally take place in condensed media. I have had very little to say so 
far about the presence or absence of a solvent, and many of the applications discussed have 
referred to single, isolated molecules at 0 K and in free space. The benchmark molecular 
dynamics (MD) and Monte Carlo (MC) applications discussed in earlier chapters dealt with 
arrays of particles with particularly simple potentials, and the quantum mechanical models 
have made no mention of a solvent. 


23.1 Solvent Models 

The obvious way to take account of the solvent in a MM calculation is to physically add 
solvent molecules and then optimize the molecular geometry, but such calculations tend 
to be particularly compute-intensive and the necessary computer power simply was not 
available to the pioneers in the field. The same comment obviously holds for quantum 
mechanical studies. 

In a medium of relative permittivity e r , the mutual potential energy of two point charges 
(2a and <2b is reduced by a factor of e r . Typical values are shown in Table 23.1. 

Relative permittivities are temperature dependent, and they are usually recorded as a 
power series in the temperature: 



Early workers in the MM field attempted to allow for the presence of a solvent by 
modifying any electrostatic contribution to their force field. The problem was that the 
chosen factor was at first sight quite arbitrary, and bore little relation to the values in 
Table 23.1. For example, a value of 2.5 was often used for liquid water. All kinds of 
convincing arguments were used to justify such choices, for example the two atoms A and 
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Table 23.1 Representative relative permittivities 


Substance 

s r 

Free space 

1 

Air 

1.0006 

Class 

6 

Water 

81 


B in the molecule of Figure 23.1 will not have much exposure to a solvent and so a pure 
solvent value of e r is not appropriate. 



Figure 23. 1 Atoms A and B in a molecule surrounded by solvent 


23.2 Langevin Dynamics 


Molecules in solution undergo collisions with other molecules and the solvent, and they 
can also be thought to experience frictional forces as they move through the solvent. The 
Langevin technique allows for both these effects. The collisions are modelled by the addition 
of a random force R, and the frictional effects are modelled by adding a frictional drag that is 
directly proportional to the velocity of a given particle. Solvent molecules are not explicitly 
included. 

We saw in Chapter 10 that the key equation for a MD simulation is Newton’s second law: 


(23.1) 


In Langevin dynamics, the force is modified according to the ideas discussed above: 


(23.2) 


The frictional term introduces energy and the random force removes it as kinetic energy. 
The quantity y is the collision frequency, and lly is often called the velocity relaxation 
time ; it can be thought of as the time taken for a particle to forget its initial velocity. There 
is an equation due to Einstein that relates y to the diffusion constant D of the solvent 


k B T 

mD 
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Integration of the new equation of motion proceeds along the lines discussed in 
Chapter 10. A straightforward algorithm has been given by Ermak and Buckholtz (1980). 
The equations of motion are integrated over a time interval At that is sufficiently short so 
that the interparticle forces remain approximately constant. The algorithm for advancing 
the position r A and velocity v A of particle A is then similar to those discussed in Chapter 10; 
we have 


r A (f + At) = r A (t) + cj At + c 2 ( Af ) 2 + Af a 

v A (t + At) = c 0 + c i Af + Av a (23.3) 

Here r A (t), (dr A /df), and (d 2 r A /dt 2 ), are the instantaneous position, velocity and acceleration 
vector of particle A. The acceleration is calculated from the force. Ar G and Av G are random 
vectors chosen from a Gaussian distribution with zero mean and standard deviations 

= 4 '* © VK, ( 2 ~ 7K, (3 -Aexp(-KA0 + exp(-2,A,))) 

— ) (1 - exp(-yAt)) (23.4) 

m K ) 

The numerical coefficients are given by 

c 0 = exp (-y At) 

1 ~ c 0 
/At 

1 ~ G 

yAt 

At low values of the friction coefficient, the dynamical aspects dominate. If the inter¬ 
particle forces are taken to vary linearly with time between each time step, the equations 
of motion can be rewritten in a form that is said to produce a more accurate simulation: 


Cl = 

c 2 = 



r A (f + At) = r A (f) + Ci At + °2 ( A 0 2 + Af a 

v A (, + A,) = c„ + (Ci - c) a, + c, ^ A, + AvJ 


(23.5) 


and as y —> 0 we recover the velocity Verlet algorithm discussed in Chapter 10. For large 
values of y , the random collisions dominate and the motion becomes diffusion-like. 

The same considerations apply to Langevin dynamics as to standard MD; there are 
three stages to a simulation, the heating phase, the data collection stage and the optional 
cooling stage. For the sake of illustration, Figure 23.2 shows the end-to-end distance in the 
hydrocarbon C20H42, over a 10 ps experiment with a 4 ps -1 friction coefficient. 
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Figure 23.2 Langevin dynamics on C 20 hydrocarbon 


The end-to-end distance varied much less than a comparable gas-phase experiment, 
showing the damping effect of the random collisions. 


23.3 Continuum Solvation Models 


In our discussion of electric charges and their properties (Chapter 1), we met the concepts 
of polarization and polarizability. The dielectric polarization P physically is the induced 
dipole moment per volume, and I mentioned that this experimentally determined quantity 
can depend on the applied field E in all manner of complicated ways. For very simple media 
and for low field strengths, it turns out that P is directly proportional to E and so 

P = fe - 1) e 0 E 

Xe = £r - 1 

where I have introduced the electric susceptibility Xe- The relative permittivity and the 
refractive index n are also related by 


Suppose now that the material is a gas comprising N atoms each of polarizability a in 
a volume V. When an external field E is applied, each atom acquires an induced dipole 
moment aE and so the polarization is 


(23.6) 
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Comparison of the macroscopic and microscopic equations gives 



which can be expanded by the binomial theorem to give 


and if the gas is a perfect gas we have finally 
n= 1 + - 


(23.7) 


which gives an experimental route to the polarizability a. 

In condensed phases, the separation between molecules is of the order of the molecu¬ 
lar dimensions and the result is that each molecule is polarized not by just the ordinary 
applied electric field E but by a local field F comprising E plus the field of all the other 
dipoles. Once the local field is known, we can use Equation (23.6) to find P simply 
by substituting aF for aE. The calculation of F is in fact difficult because the dipoles 
that contribute to F are themselves determined by F and a self-consistent treatment is 
needed. 

A little manipulation yields the well-known Clausius-Mossotti equation 


e r — 1 _ Not 
e r + 2 ~~ 3^V 


(23.8) 


which can be used to calculate a from an experimental value of e r . 

So far we have assumed that the molecules in the system have no permanent electric 
dipole moments. When this restriction is relaxed we have to allow for the orientation 
polarization-, the interaction energy of a dipole moment vector with an electric field is 


U = — Po-E 

so that in a fluid the molecules will tend to orient themselves parallel to the field. This 
tendency will be opposed by random thermal agitation, and a calculation by the methods of 
statistical mechanics yields Debye’s (1912) correction to the Clausius-Mossotti equation 



Debye’s dipole theory accounts quantitatively for the dielectric properties of gases and 
qualitatively for those of condensed phases. A more thorough analysis of the problem 
shows that one should not expect the dipole moment to remain constant because real 
molecules have a nonzero polarizability. The polarization of the dielectric in the electric 
field of the molecule itself gives rise to a reaction field. The classic literature reference is 
Onsager (1936). 

Continuum models of the solvent have their origin in these physical considerations; one 
component of the system (the solute M) is treated microscopically and the remainder of 
the system (the solvent) is treated macroscopically. 
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Figure 23.3 The Onsager model 


In any continuum model, M resides in a cavity carved out of an infinite slab of continu¬ 
ous dielectric (the solvent), as shown in Figure 23.3. The cavity is a basic concept in all 
continuum models; the size and shape of the cavity are differently defined in the various 
versions of continuum models but, as a general rule, it should exclude the solvent and con¬ 
tain within its boundaries the largest possible part of the solute charge distribution. Much 
attention has been paid to the portion of the solute electronic charge outside the boundary 
of the cavity and the term escaped charge is often used in this context. 

It is universally accepted that the cavity shape should ideally reproduce the molecular 
shape but as usual in modelling studies there is a trade-off between computer resource 
and the desire for better accuracy. Calculations are usually easiest when the cavity has a 
simple shape. 

23.3.1 The Onsager Model 

In the simplest such model, named after Onsager, the cavity is a sphere of fixed radius; the 
relative permittivity is 1 inside the cavity and equal to that of the solvent outside. M must 
have a nonzero dipole moment, which induces a polarization P in the dielectric. P then 
reacts with the solute dipole, leading to stabilization. Workers in the field speak about the 
self-consistent reaction field (SCRF). 

A suitable Onsager sphere radius for (for example) phenylanine can be found from a 
Gaussian 03 run such as: 

%chk=d:\phenopt.chk 

# B3LYP/6-311G** Opt geom=check volume 
L-phenylanine geometry optimization then volume 
0 1 

which produced the following MC calculation of the molecular volume, and estimate of 
the cavity sphere radius. 

Using the total density. 

Monte-Carlo method of calculating molar volume: 
based on 0.001 e/bohr**3 density envelope. 

Number of points per bohr**3 = 20 CutOff= 1.00D-04 
Using the SCF density. 

There are 379 points. Will hold 379 in memory. 
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LenV= 6095246 MDV= 6291456. 

Box volume = 6850.641 fraction occupied=0.214 
Integrated density= 8.0982806152148228D+01 
error=-7.0171938478517717D+00 
Molar volume = 1464.121 bohr**3/mol (130.657 cm**3/mol) 

Recommended aO for SCRF calculation = 4.60 angstrom (8.70 bohr) 

Finally, an Onsager SCRF calculation using dichloromethane (arbitrarily) as solvent: 
%chk=d:\phenopt.chk 

# B3LYP/6-311G** scrf=(dipole,a0=4.60, 
solvent=dichloromethane) 

# geom=check 

L-phenylanine Onsager model 
0 1 

gives the following: 

SCRF input: a0= 4.60 angstrom, dielectric constant= 8.93 

Requested convergence on RMS density matrix=l. 00D-04 within 128 cycles . 

Requested convergence on MAX density matrix=l. 00D-02 . 

Requested convergence on energy=5.00D-05 . 

No special actions if energy rises. 

SCF Done: E(RBtHF-LYP) =-554.947318515 A.U. after 7 cycles 
Convg = 0.4858D-05 -V/T = 2.0038 

S**2 = 0.0000 
Final SCRF E-Field is: 

Dipole : -0.00005916 -0.00025177 -0.00075469 
Quadrupole : 0.00000000 0.00000000 0.00000000 
0.00000000 0.00000000 0.00000000 
Octapole : 0.00000000 0.00000000 0.00000000 0.00000000 
0.00000000 0.00000000 0.00000000 0.00000000 
0.00000000 0.00000000 

Hexadecapole: 0.00000000 0.00000000 0.00000000 0.00000000 
0.00000000 0.00000000 0.00000000 0.00000000 
0.00000000 0.00000000 0.00000000 0.00000000 
0.00000000 0.00000000 0.00000000 
Polarization energy = -0.248405263057E-03 
Total energy of solute = -554.947566921 
Total energy (include solvent energy) = -554.947318515 
Total energy (without reaction field) = -554.947070110 
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23.3.2 Polarizable Continuum Models 

It is possible in principle to define and determine the cavity from a straightforward ab 
initio/DVT calculation by defining a molecular isodensity level outside which electron 
density is treated as negligible. Such calculations are possible within Gaussian 03 but 
literature applications are rare (presumably because of the expense). 

The generally accepted compromise between computer resource and accuracy is based 
on the definition of the cavity as a superposition of interlocked atomic spheres with radii 
near the van der Waals radius. Our key reference is the review by Tomasi et al. (2005), 
which builds on the oft-quoted review by Tomasi and Persico (1994). 

For example, we might want to investigate the geometry and molecular properties of 
phenylanine in different solvents using the PCM model. Here is a suitable Gaussian 03 .gjf: 

%chk=d:\phenopt.chk 

# B3LYP/6-311G** serf = (pcm,solvent=dichloromethane) opt freq 

# geom=check 
L-phenylanine PCM model 

0 1 

I have not shown the output for lack of space, but if you care to run the job then everything 
will be instantly recognizable, including the optimized geometry and final energy. 


23.4 Periodic Solvent Box 

In Langevin dynamics and continuum models, the solvent is simulated; no solvent 
molecules are explicitly included in the calculation. Such calculations give little micro¬ 
scopic detail. 

I mentioned the interest in water as solvent in earlier chapters, and it is worth reading 
the abstract to Jorgensen’s (1983) landmark Monte Carlo paper: 

Classical Monte Carlo simulations have been carried out for liquid water in the NPT ensemble 
at 25°C and 1 atm using six of the simpler intermolecular potential functions for the water 
dimer: Bemal-Fowler (BF), SPC, ST2, TIPS2, TIP3P and TIP4P. Comparisons are made 
with experimental thermodynamic and structural data including the recent neutron diffraction 
data of Thiessen and Narten. The computed densities and potential energies are in reasonable 
accord with experiment except for the original BF model, which yields an 18% overestimate 
of the density and poor structural results. The TIPS2 and TIP4P potentials yield oxygen- 
oxygen partial structure functions in good agreement with the neutron diffraction results. The 
accord with the experimental OH and HH partialstructure functions is poorer; however, the 
computed results for these functions are similar for all the potential functions. Consequently, 
the discrepancy may be due to the correction terms needed in processing the neutron data 
or to an effect uniformly neglected in the computations. Comparisons are also made for self¬ 
diffusion coefficients obtained from molecular dynamics simulations. Overall, the SPC, SDT2, 
TIPS2 and TIP4P models give reasonable structural and thermodynamic descriptions of liquid 
water and they should be useful in simulations of aqueous solutions. The simplicity of the 
SPC, TIPS2, and TIP4P is also attractive from a computational standpoint. 
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Modem molecular modelling packages such as HyperChem usually include the option 
of a periodic solvent box, often the TIP3P box of 216 water molecules. The example shown 
in Figure 23.4 is from a MM optimization using phenylanine in water. 



Figure 23.4 Phenylanine in periodic solvent box 


The standard box is a cube of side 1870 pm, and it is conventional to choose the appro¬ 
priate box with sides in multiples of this. I should not need to remind you by now of the 
significance of the word ‘periodic’. 
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Hybrid Models; the QM/MM 
Approach 


It all started with a paper (Warshel and Levitt 1976) in the Journal of Molecular Biology, 
which by and large is not read by chemists. Once again, I will let the two authors give you 
their ideas in their own words. 

A general method for detailed study of enzymic reactions is presented. The method considers 
the complete enzyme-substrate complex together with the surrounding solvent and evaluates 
all the different quantum mechanical and classical energy factors that can affect the reaction 
pathway. These factors include the quantum mechanical energies associated with bond cleavage 
and charge redistribution of the substrate and the classical energies of steric and electrostatic 
interactions between the substrate and the enzyme. The electrostatic polarization of the enzyme 
atoms and the orientation of the dipoles of the surrounding water molecules is simulated 
by a microscopic dielectric model. The solvation energy resulting from this polarization is 
considerable and must be included in any realistic calculation of chemical reactions involving 
anything more than an isolated atom in vacuo. Without it, acidic groups can never become 
ionized and the charge distribution on the substrate will not be reasonable. The same dielectric 
model can also be used to study the reaction of the substrate in solution. In this way, the reaction 
is solution can be compared with the enzymic reaction... 

What the authors did was to combine a molecular mechanics (MM) potential for the 
solvent with an early (MINDO/2) quantum mechanical model for the solute. By 1998 such 
hybrid methods had become sufficiently important to justify an American Chemical Society 
symposium (Gao and Thompson 1998). 

The early hybrid models such as that of Warshel and Levitt are often referred to as 
‘QM/MM’. Note that there is no mention of geometry optimization, and that attention 
focuses on the solute/solvent problem. 
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24.1 Link Atoms 

In the case of solution-phase studies, it is clear where we should draw a boundary between 
the parts of a system that ought to be studied quantum mechanically, and those that can be 
treated by the techniques of MM. 

Biochemists have different problems in mind; they want to divide parts of a protein chain 
into regions of interest that ought to be treated quantum mechanically, and the remainder 
of the chain that can be treated according to the methods of MM. 

Consider for example the protein shown in Figure 24.1. The bottom left-hand amino acid 
is valine, which is linked to proline. Suppose for the sake of argument that we wanted to 
treat this valine quantum mechanically, and the rest of the protein chain according to the 
methods of MM. We would have to draw a QM/MM boundary somewhere between valine 
and the rest of the protein. The link atoms define the boundary between the QM and the MM 
regions. A great deal of care has to go into this choice of boundary. The boundary should 
not give two species whose chemical properties are quite different from those implied by 
the structural formulae on either side of this boundary. 



Figure 24.1 A typical protein (IPCN, from the Protein Databank) 


In order to achieve this aim, it is usual to ‘cap’ the exposed valency at the boundary by 
adding an extra quantum mechanical atom. There are several reasons for doing this, not the 
least of which is that we do not want to be left with a radical for the quantum mechanical 
part. The extra quantum mechanical atom does not see any atoms within the MM region 
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and the final atom of the MM region is attached to this quantum mechanical atom by a 
harmonic MM potential. 


24.2 IMOMM 

When faced with the theoretical study of a system involving very large molecules, there are 
two possible approaches. One is to model the real system as a simpler one and proceed with 
(for example) an ab initio treatment of the simpler system. Thus for example, transition 
metal complexes often contain bulky ligands with large organic susbtituents; those far away 
from the metal atom are usually replaced by hydrogen atoms. This approach works well 
because the metal-ligand interaction remains well described. The only problem arises when 
the bulky ligands give rise to steric interactions. 

Molecular mechanics methods have found favour in biochemical applications where 
very large molecules are involved, or areas such as MD where a large number of different 
calculations are required. 

One possible way to overcome the shortcomings of both approaches is to mix quantum 
mechanics and MM in different parts of the same molecule. The integrated ab initio + 
molecular mechanics (IMOMM) scheme was introduced by Maseras and Morokuma (1995). 
As usual, I will give the abstract. 

A new computational scheme integrating ab initio and molecular mechanics descriptions in 
different parts of the same molecule is presented. In contrast with previous approaches, this 
method is especially designed to allow the introduction of molecular mechanics corrections 
in full geometry optimizations concerning problems usually studied through ab initio calcu¬ 
lations on model systems. The scheme proposed in this article intends to solve some of the 
systematic error associated with modelling through the use of molecular mechanics calcula¬ 
tions. This method, which does not require any new parameter, evaluates explicitly the energy 
derivatives with respect to geometrical parameters and therefore has a straightforward applica¬ 
tion to geometry optimization. Examples of its performance on two simple cases are provided: 
the equilibrium geometry of cyclopropane and the energy barriers on S N 2 reactions of alkyl 
chloride systems. Results are in satisfactory agreement with those of full ab initio calculations 
in both cases. 

Twenty years is a long time in molecular modelling, and it is interesting to note both the 
change of emphasis for hybrid models and the increased level of sophistication. Geometry 
optimization is de rigueur, and transition states are fair game for scrutiny. 

To fix our ideas, I will discuss the (hypothetical) ‘real’ system M(P(CH 3 ) 3 ) 2 used by the 
authors quoted above. Here M is a transition metal such as platinum. Our ‘model’ system 
for the ab initio study is M(PH 3 ) 2 where we have replaced the methyl groups by hydrogen. 
In Figure 24.2 we divide the atoms in both the real (MM, lower molecule) and model (ab 
initio, upper molecule) systems into four different sets, each of them with corresponding 
coordinates. 

• Set 1 atoms are present in both. 

• Set 2 atoms are present in the ab initio system but not in the MM. 

• Set 3 atoms are present in the MM calculation, but each atom is replaced by a simpler 
one in the ab initio calculation (these are of course the ‘link atoms’ discussed earlier). 

• Set 4 atoms are present only in the MM calculation. 
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Figure 24.2 The (upper) model system is for ab initio study; the (lower) real system for MM 
study 


The coordinates of atoms in set 3 are a function of those of atoms in sets 1 and 2, and the 
authors show how to write equations for the energies and gradients for the combined ab 
initio and MM descriptions. Care has to be taken to eliminate double counting of energy 
terms. The algorithm is as follows. 

• Input coordinates for the set 1, set 2 and set 4 atoms. 

• Use the ab initio procedure to calculate the energy and the gradients with respect to the 
coordinates of set 1, and of set 2. 

• Carry out a MM optimization for the set 4 atoms keeping set 1 and set 3 frozen. 

• Add the energies and gradients and test for self consistency. 


24.3 IMOMO 

The next step, integrated molecular orbital + MO, came in 1996 and once again I will let 
the originators (Humbel et al. 1996) spell out the good news. 

A new theoretical method, called IMOMO (integrated MO (molecular orbital) + MO), for 
integration of two different levels of MO approximation is presented. Only the active or more 
difficult part of a molecule is treated at a higher level of approximation and the rest of the 
molecule at a lower level of approximation. The integrated total energy and energy derivatives 
are defined from three different calculations, and the structure of transition state as well as the 
equilibrium structure can be optimized using the integrated energy. Any combination of any 
molecular orbital approximations (ab initio, density functional to semi-empirical) can be used. 
Test calculations in the IMOMO method have been performed and compared with normal MO 
calculations for the conformation energy of ethane and n-butane and the S N 2 reaction of ethyl, 
propyl, isobutyl and neopentyl chloride with Cl'. The results indicate that these methods have 
a tremendous potential for theoretical study of larger molecules, in particular for transition 
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24.4 ONIOM (Our Own AMayered Integrated Molecular Orbital 
and Molecular Mechanics) 

We note several different schemes in which a molecule is divided into two parts, with one 
accurately described and the other treated at a lower level of theory. The total energy of the 
real system is a combination of both parts. For molecular systems normally treated with 
MM, the integrated IMOMM method is appropriate. For systems and reactions that require 
a more accurate description of the geometries and energies, the integrated IMOMO method 
has been shown to reproduce high-level ab initio results. The next logical step involves 
combining these approaches into an onion-like multilayered scheme, with MM used to 
describe steric and electrostatic effects in the exterior parts of a molecule, an intermediate 
MO method describing the electronic effects of functional groups or ligands close to the 
centre of action, and a highly accurate method to deal with the electron correlation on the 
most important parts of the molecule. The layers are conventionally known as the low, 
medium and high layers; these are treated with different model chemistries and the results 
automatically combined to give the final predicted results, and such a calculation would 
be denoted ONIOM3 (Svensson et al. 1996). Rather than quote the original abstract, I will 
give you a more up-to-date version due to Vreven and Morokuma (2000) that contains a 
wealth of useful detail. 

Five years ago Morokuma and colleagues introduced the IMOMO method, which integrates 
two molecular orbital methods into one calculation. Since then, the method has been expanded 
in several ways; it has been generalized to consider up to three methods, and has been unified 
as the ONIOM method to include both MO and MM combinations. In this review we present 
the history of the method, a number of chemical problems that we have studied, how to assess 
IMOMO combinations and partitionings, and our latest efforts that take the method beyond 
the conventional investigation of ground state energy surfaces. In particular we emphasize 
the importance of the S-value test for validation of the ONIOM method/model combinations. 
The method combination depends much on the properties and accuracies required. Generally 
speaking, however, if the target level is CCSD(T) or G2, the best choice of low level is MP2. 

If MP2 or DFT is the target level, HF or eventually semiempirical MO methods are good 
choices of low level. These methods can be further combined with an outer-most layer of the 
MM level. 
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Appendix A: A Mathematical 
Aide-Memoire 


There are several mathematical topics such as vectors, vector calculus, determinants, 
matrices and linear operators that appear throughout this book. You should have them 
at your fingertips if you want to fully understand the subject; it is no use speaking glibly 
about the scalar and cross products of two vectors, gradients and Hessians, eigenvalues and 
eigenvectors, Hamiltonian operators and so on when you have not the slightest idea what 
the terms mean. I assume that you have come across most of these topics elsewhere, and 
so this appendix is essentially an aide-memoire. 

Angular momentum dominates the theory of atomic structure (and if you know about 
commuting linear operators, you will understand why), so I have included a section on this 
topic. 


A.l Scalars and Vectors 

I should emphasize that physical quantities such as mass, temperature and time have two 
component parts: a measure (i.e. how many) and a unit (e.g. a kelvin or a second). It is 
usual these days to express all physical quantities in the system of units referred to as the 
Systeme International (SI). The International Union of Pure and Applied Physics and the 
International Union of Pure and Applied Chemistry both recommend SI units. These are 
based on the metre, kilogram, second and ampere as the fundamental units of length, mass, 
time and electric current. There are three additional units in SI: the kelvin, mole and candela 
that are units of thermodynamic temperature, amount of substance and luminous intensity, 
respectively. 

A scalar sis a quantity such as mass, temperature, time that can be represented by a single 
value. The modulus (or magnitude, or size) of s, denoted ,sj, is the value of s irrespective 
of its sign. So if T is a temperature and T = —273 K, the modulus of T is 273 K. If you 
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know about thermodynamics, you will realize that temperatures cannot be lower than 0 K, 
but that is another story. 

A vector quantity is a quantity such as displacement, force, velocity that has both a 
magnitude and a direction in space. I am going to write vectors in bold script such as a, b, 
E. Other authors follow different conventions, and you might find it easier to write vectors 
by underlining them a, b, E rather than trying to write them in bold print with a biro. My 
continental European colleagues often write vectors with an arrow above the symbol a, 
b, E. 

The size or modulus of vector v is denoted by v or |v|. It is important to distinguish 
vector quantities from scalar ones; they are certainly not the same thing. Develop the habit 
of writing vectors and their moduli in a distinctive style. Once again, you should recognize 
that vectors representing physical quantities have a measure and an SI unit. A unit vector is 
one with unit modulus (apart from the SI unit), and a unit vector in the direction of vector 
vis v/v. Some authors write unit vectors with a special ‘hat’, typically v. I am not going to 
follow this convention, because I want to reserve the hat symbol for a linear operator , of 
which more shortly. 


A.2 Vector Algebra 

Vector notation makes many equations look simpler than they would otherwise be, and 
so I will make extensive use of vectors through the book. I need to remind you of certain 
elementary properties. 

A.2.1 Vector Addition and Scalar Multiplication 

It is common practice when drawing vectors to show their direction by including an arrow¬ 
head, as illustrated in Figure A.l. The vector 2v is a vector in the same direction as v but 
with twice the modulus. The vector —2v has twice the modulus of v but it points in the 
opposite direction. For many kinds of vector, it does not matter where on the page we draw 
them. Two vectors are equal if they have the same direction and magnitude. 



Figure A. 1 Scalar multiplication of a vector 

Two vectors u and v are added together according to the parallelogram rule', we draw u 
and then add v onto u according to Figure A.2. 

Vectors obey the law that u + v = v + u and we could have equally drawn v first and 
then added u to get the sum of the two. 
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Figure A.2 Addition of two vectors 


A.2.2 Cartesian Coordinates 

The point P in Figure A.3 has Cartesian coordinates ( x , y, z). The Cartesian unit vectors 
e x , e v and e z are directed parallel to the x, y and z axes respectively. They are sometimes 
written i, j and k rather than e „ e y and e z . The position vector r of point P can be expressed 
by drawing an arrow from the origin to P, and in view of the laws for vector addition and 
scalar multiplication we have 


r = xe x + ye y + ze z 


(A.l) 


I have indicated the components x, y, z in Figure A.3. 



Figure A.3 Cartesian components of a vector 


A.2.3 Cartesian Components of a Vector 

Vectors u and v can be specified by their Cartesian components 


u = u x e x + u y e y + w z e z 



so that the vector ku (where k is a scalar) has components 


ku = ku x e x + ku y e y + ku z e z 
whilst the vector sum u + v has components 


(A.2) 


u + v = (u x + v x ) e x + {uy + v y ) e y + (u z + v z ) e z 


(A.3) 






382 Molecular Modelling for Beginners 


A.2.4 Vector Products 

Vectors can be multiplied by scalars, but some care is needed when considering products 
of two vectors. 

A.2.4.1 Scalar (or Dot) Product 

The scalar (or dot) product of two vectors u and v is 

u.v = |u| |v| COS0 (A.4) 

where 0 is the angle between u and v, and |u|, |v| are the moduli of the vectors. If u.v = 0 
and neither of u and v are zero vectors, then we say that u and v are orthogonal (in other 
words, they are at right angles to each other). 

Scalar products obey the rules 


u.v = v.u 

u. (v + w) = u.v + u.w 
and the Cartesian unit vectors satisfy 

e^.e^, = e*.e z = e,,.e z = 0 
It follows that the scalar product of u and v can be written 

u.v = u x v x + u y v y + u z v z (A.5) 


and the modulus of vector v is 

|v| = (v.v) 1/2 = (v 2 x + v 2 y + vl) l/2 

It also follows that the angle 6 between the vectors u and v is given by 


cos 0 = 

The scalar product of u and v gives the projection of u in the direction of v (Figure A.4), 
which by symmetry is the same as the projection of v in the direction of u. 



Figure A.4 Projection of u in the direction of i 
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A.2.4.2 Vector (or Cross ) Product 

In addition to the scalar product u.v, there is a vector (or cross) product u x v 

u x v = |u| |v| sin0n (A.6) 

where 9 is the angle between u and v and n is a unit vector normal to the plane containing 
u and v. The direction of u x v is given by the direction a screw would advance if rotated 
from the direction of u to the direction of v. This is shown in Figure A.5. 



Figure A.5 Vector product of u and v 
Vector products obey the rules 

U X V = —v X u 
ux(v + w) = uxv + uxw 
and the Cartesian unit vectors satisfy 

e, x e, = e, x e, = e z x e z = 0 
e,^ = e z 
e y e z = e x 


It follows that the vector product of u and v can be written 

u X V = (uyV z - u z v y ) e x + (u z v x - u x v z ) e,, + ( u x v y - UyV x ) e z (A.7) 

A.3 Scalar and Vector Fields 

Mathematically, a field is a function that describes a physical property at points in space. 
In a scalar field, this physical property is completely described by a single value for each 
point (e.g. temperature, electron density). A scalar field can be represented pictorially by 
contours, which are fines or surfaces that link together points with the same value of the field 
in a given plane. An example is shown in Figure A.6 (taken from a quantum mechanical 
study of the electron distribution in a diatomic molecule; the contour fines join points of 
equal electron density in a plane drawn so as to contain the two nuclei, which on close 
inspection can be identified as the circles joined by a ‘bond’). 

For quantitative work we need to know the values of the field at each contour; this is 
sometimes achieved by labelling the contours or by a system of colour coding. For vector 
fields (e.g. gravitational force, electrostatic field intensity), both a direction and a magnitude 
are required at for each point. 
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Figure A. 6 Contour lines for a scalar field 


A.4 Vector Calculus 

In this section, I want to explain how fields can be differentiated. I will then discuss two 
aspects of scalar fields; their gradient (which describes their rate of change in space) and 
their integrals over surfaces and volumes. There are other advanced concepts that you might 
have come across, for example the divergence and curl of a vector field. I do not intend to 
draw on these ideas in this text. 


A.4.1 Differentiation of Fields 

Suppose that the vector field u(t) is a continuous function of the scalar variable t. As t 
varies, so does u and if u denotes the position vector of a point P, then P moves along a 
continuous curve in space. 

For most of this book we will identify the variable t as time and we might be interested 
in studying the trajectory of a particle along the above curve in space. Suppose then that a 
particle moves from point P to point Q as in Figure A.7, in time 8 1. 

Mathematicians tell us that we will profit from a study of what happens to the ratio 
8u/8t as 8f gets smaller and smaller, just as we do in ordinary differential calculus. The first 
differential du/dr is defined as the limit of this ratio as the interval 8 1 becomes progressively 
smaller: 




(A.8) 
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Figure A.7 A curve in space 


It is a vector directed along the tangent at P (left to right in the figure). The derivative 
of a vector is the vector sum of the derivatives of its components. The usual rules for 
differentiation apply: 


du dv 
d t d t 


d 

dt 


(ku) 


d 

Jt 


(/u) 


fu 

dt 


+f 


du 

dt 


where k is a scalar and/ a scalar field. 


A.4.2 The Gradient 

One of the most important properties of a scalar field is the rate at which it changes in 
space. A vector called the gradient vector describes this. Suppose that / is a scalar field, 
and we wish to investigate how/ changes between the points r and r + dr. Here 

dr = e^dx + e^dy + e^dz 


and the change in/ is 


d/=/(r + dr)-/(r) 


We know from the rules of partial differentiation that 



and so we identify d f as a certain scalar product 


d/ = 


( 


h 


3f df 

—e, + — 
dy* dz 


. (e x dx + e y dy + e,dz) 
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The first vector on the right-hand side is called the gradient off, and it is written grad/ (in 
this text): 


8 f df df 

grad / = —e x + — e, + — e, 
dx dy dz 


(A.9) 


An alternative notation, used in other texts, involves the use of the so-called gradient 
operator V (pronounced del) 


V - 


9 9 9 

—e, + —e, + —e z 
dx dy dz 


In this case the gradient off is written V/. 

The vector grad/ is a vector field whose direction at any point is the direction in which/ 
is increasing most rapidly and whose magnitude is the rate of change off in that direction. 
The spatial rate of change of the scalar field/ in the direction of an arbitrary unit vector e 
is given by the scalar product e.grad/. 


A.4.3 Volume Integrals of Scalar Fields 

In order to evaluate quantities such as the mass or electric charge contained within a region 
of space, it is necessary to evaluate certain volume integrals. 

I can best illustrate the concept with the simple example shown in Figure A. 8. Suppose 
that the electric charge density p inside a cubic box whose faces are the planes x = 0, 
x = 1; y = 0,y = 1; z = 0, z — 1 is given by p = p 0 (x+y + z), where p 0 is a constant. 


Figure A.8 A cubic volume 


We divide each axis into differential elements, cbc’s along the x axis, dy’s along the y axis 
and dz’s along the z axis, giving a number of infinitesimal differential volume elements 
each of volume dxdydz. The charge enclosed by each differential volume element cbcdydz 
is pdxdydz and so the total Q enclosed by the box is 


’-/// 


p 0 (x + y + z) dxdydz 
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We form three single integrals. First of all, we draw within the region a column having 
cross-section d z, and constant x and y. 

In order to add these contributions to Q, we integrate between the limits of z = 0 and 
z = 1 to give 



Next we form the y integral by drawing a slice parallel to the yz plane and including the 
column. The limits of the y integration are y = 0 and y = 1: 



Finally we complete the volume by adding together all the slabs, and the limits of the x 
integration are x = 0 to x = 1: 



I have used the symbol dr to denote a volume element; other commonly used symbols 
are dV, dr, dr. 


A.4.4 Line Integrals 

Figure A.9 refers to a particle at point P moving along a curve in space under the influence 
of a force F. I have taken the force to be centred at the origin, for example the particle 
at P might be a point charge moving under the influence of another point charge at the 
origin. The work done w in moving the short distance 8r is the projection of F along the 
displacement F.8r. In order to calculate the total work done as the particle moves from 
initial point A to final point B, we divide the curve into small segments 8r 1; 8r 2 , . . . 8r ;V . 
We then have 


W *J>(r/) .8r, 


We let the number of points become infinitely large, and the summation approaches a limit 
called the line integral. 



(A. 10) 


If the points A and B coincide, then the line integral refers to a closed curve and we often 
write the line integral with a circle round the integral sign 
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Figure A.9 Work done under the influence of a force 

If the line integral of a certain vector field F is zero around any arbitrary closed path, 
then the vector field is called a conservative field. It can be shown that every conservative 
field can be written as the gradient of a suitable scalar field 

F = —grad0 (A. 11) 

The importance of scalar fields such as <p is that changes in </> depend only on the starting 
point and finishing point, and not on the path chosen to get between these points. 


A.5 Determinants 

The set of simultaneous linear equations 

£*11*1 + a l 2 x 2 = b x 
a 2 \Xx + a 22 x 2 = b 2 

has solution 

b\a 22 — b 2 a l2 b 2 a n — b 2 a 2l 

Xx = - x 2 = - 

£*11022 — £*12021 «11«22 — £*12<*21 

If we define a determinant of order 2 by the symbol 

= (ad — be) (A. 12) 

then these solutions may be written as 

I bx a 12 I I On bx 

| b 2 a 22 | _ | o 2 i b b 

011 £*12 I I £*n £*12 

£*21 £*22 I I £*21 £*22 


I a b I 
| c d | 


Xx = 
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We speak of the elements, rows, columns and diagonal of the determinant in an obvious 
manner. In a similar fashion, solution of the three simultaneous linear equations 

a n Xi + a 12 x 2 + a n x 3 = b x 
021*1 + 022*2 + 023*3 = &2 
031*1 + 032*2 + 033*3 = h 


involves a denominator 


011022033 — 011023032 + 021032013 — 021012033 + 031012023 — 031022013 


and so we define a determinant of order three by 


011 012 013 

021 022 023 

031 032 033 


= 011022033 — 011023032 + 021032013 — 021012033 + 031012023 — 031022013 

(A. 13) 


By inspection, this can be written in terms of certain determinants of order 2 


011 012 013 

021 022 023 

031 032 033 


= All 


022 023 

032 033 


021 


012 013 . 012 013 

+ 031 

032 033 | | 022 023 


The three determinants of order 2 are called the minors of a n , a 21 and a 3l respectively; 
they are the determinants produced by striking out the first column and successive rows of 
the determinant of order 3. A little analysis shows that this determinant may be expanded 
down any of its columns or along any of its rows by suitably combining products of elements 
and their minors. 

The properties of determinants are as follows. 

1. The value of a determinant is unchanged by interchanging the elements of any corres¬ 
ponding rows and columns. 

2. The sign of a determinant is reversed by interchanging any two of its rows or columns. 

3. The value of a determinant is zero if any two of its rows (or columns) are identical. 


A.6 Matrices 

A matrix is a set of m x n quantities arranged in a rectangular array of m rows and n columns, 
for example 

^ 011 012 • • • «ln \ 

021 022 • • • 02n I 

\ a m i a m2 ... a mn ) 

Throughout this text, I will denote matrices by bold letters, just like vectors. The matrix 
above is said to be of order m by n (denoted mxri). The null matrix O is one whose elements 
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are all zero. If m = n, the matrix is said to be square of order n, and a square matrix whose 
only nonzero elements are the diagonal elements is said to be a diagonal matrix. Thus 


(an o 0 \ 
0 a 22 0 

\ 0 0 a 33 J 


is a diagonal matrix. The unit matrix (1 or I) of order n is a diagonal matrix whose elements 
are all equal to 1. 

A row vector is a matrix of order lxn and a column vector is a matrix of order n x 1. 
For example (a l , a 2 ,..., a n ) is a row vector and 



is a column vector. 

A.6.1 Transpose of a Matrix 

Interchanging rows and columns of a determinant leaves its value unchanged, but inter¬ 
changing the rows and columns of a matrix A produces a new matrix called the transpose 
A T . Thus for example if 



In the case that A = A T then we say that the matrix A is symmetric. A symmetric matrix is 
square and has ay = %. 

A.6.2 Trace of a Square Matrix 


Consider the matrix 



The elements a n , a 22 and a 33 are called the diagonal elements. The sum of the diagonal 
elements is called the trace of the matrix: 


TrA = a n + a 22 + a 33 


(A. 14) 


and the following rules are obeyed: 


Tr (kA) = kTrA 
Tr (AB) = Tr (BA) 


(A. 15) 


Tr (A + B) = TrA + TrB 
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A.6.3 Algebra of Matrices 

We need to be aware of a few simple matrix properties as follows. 

1. If A and B are two matrices of the same order with elements a y and by, then their sum 
S = A + B is defined as the matrix whose elements Cy = ay + by. 

2. Two matrices A and B with elements ay and by are equal only if they are of the same 
order, and all their corresponding elements are equal ay = by. 

3. The result of multiplying a matrix A whose elements are ay by a scalar k is a matrix 
whose elements are ka ir 

4. The definition of matrix multiplication is such that two matrices A and B can only be 
multiplied together to form their product AB when the number of columns of A is equal 
to the number of rows of B. Suppose A is a matrix of order (m x p), and B is a matrix 
of order (p x n). Their product C = AB is a matrix of order (m x n) with elements 

C y = J2 a ik b kj (A. 16) 

Thus for example if 



so we see from this simple example that AB is not necessarily equal to BA. If AB = BA 
then we say that the matrices commute. 


A.6.4 Inverse Matrix 

Let A be the square matrix 


a ln \ 
a 2n 


a nn 


If a suitable matrix X can be found such that AX = 1 then we refer to X as the inverse of A, 
and write it A -1 . We say that A is invertible or nonsingular. Not all matrices are invertible; 
the rule is that square matrices with nonzero determinant such as 
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are invertible, whilst those with zero determinant such as 



are not. Matrices with zero determinants are often called singular. 


A.6.5 Matrix Eigenvalues and Eigenvectors 


Consider once again an ( n x n) matrix A. If we form the product of A with a suitable but 
arbitrary column vector u then sometimes it will happen that the product Au is a linear 
multiple k of u, Au = au. In this case we say that u is an eigenvector (or eigenfunction) of 
A with eigenvalue k. There are exactly n eigenvalues and eigenvectors. Thus for example if 


then we find 


0 1 
= 1 0 
\0 1 


0 

1 

0 


) 


A., = V2 and u[ = ( I ^5. I ) 

V 2 2 2 J 

k 2 = 0 and Uj = ^ — ^ 0 ^ ^ 

k 3 = -V2 and = ( I I ) 

V 2 2 2 / 


If two or more eigenvalues are the same, we say that that they are degenerate. 


A.7 Angular Momentum 


Figure A.7 shows a particle moving along a curve in space. If the particle has mass m then 
its linear momentum p is defined as 


P = 


dr 

m d t 


(A. 17) 


Newton’s second law of motion relates force to the rate of change of linear momentum 


F 


dp 

d t 


(A. 18) 


In the absence of an applied force, the linear momentum is constant. When the mass is 
constant. Equation (A. 18) can be written F = md 2 r/df 2 . 

The angular momentum 1 is defined as a vector cross product 


1 = r x p 


(A. 19) 
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Care has to be exercised when dealing with angular momentum, because 1 depends on the 
point chosen to be the coordinate origin. If we were to choose a new coordinate origin such 
that the position vector of the particle were 

r =r + R 


where R is a constant vector, we have 


I' = r' x p 

= 1 + R x p 


On differentiation of both sides of Equation (A. 19) with respect to time we find 


dl 

df 

dl 

dr 

dl 

dr 


dp 

dr 


rxF 


The vector cross product of r and the applied force F, rxF, is known as the torque and so 
I have proved that in the absence of an applied torque, the angular momentum of a particle 
remains constant in time. 

Consider now a particle of mass m executing circular motion around the z axis as shown 
in Figure A. 10. We can write 


r = xe x + ye y + ze z 
= a sin 0e x + a cos Oty + de z 



Figure A. 10 Circular motion around the z axis 
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Differentiating we have 


It also follows that 


dr d6 d 0 

— = —a — cos 0e x + a — sm 0e v 
d t d t d t 

dO . , 

= a t (- y e* + x *y) 

dO 

~ di* z x r 

I = rxp 


ao , , , ,, x 

= m— {-zxe x + zye y + (x + y ) e z ) 

The vector 1 is not directed along the z axis because of the choice of ‘origin’ along the z 
axis but for rotation with z = 0we have 

x de . 2 

l=m ^A x +>’)'. 


The quantity ma 2 turns out to have a special significance for circular motion, and we call 
it the moment of inertia about the z axis. 


A.8 Linear Operators 

I now need to remind you about linear operators, mathematical entities that form a key 
concept in the language of quantum mechanics. An operator A can be thought of as a 
mathematical entity that turns one function into another. It is conventional to write them 
with the hat sign in order to distinguish them from other mathematical constructs. To keep 
the discussion simple, I will just deal in this section with operators that work on functions 
of a real single variable that I will write consistently as x. 

Thus for example we might have 

Af{x) = (x 2 + !)/(*) (A.21) 

which means that the action of A is to replace/(x) by (x 2 + 1 )f(x). So if/'(x) = exp(x) we 
wouldhave/(0.1) = 1.105 andA/(0.1) = (0.1 2 +1) x 1.105 = 1.116.1 should emphasize 
that this is not an equation that must somehow be solved for a value of x, it is a mapping 
between two sets of functions often represented as a Venn diagram (Figure A. 11). 
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Figure A.11 Venn diagram 


Now consider the operator B which operates on a differentiable function/(x) to give the 
first derivative d/(x)/dx. We can write this as 

= (A.22) 

dx 

so B acting on exp(x) gives exp(x). It is clear that action of these two operators on the same 
function generally gives a different result. 

An operator is linear if it satisfied the following two properties: 

A (f (x) + g (x)) = Af (x) + Ag (x) 

A (cf (x)) = cAf (x) where c is a scalar 


and both of the two operators discussed above are linear. 

Consider now the product of two operators A and B which we write as AS. We define 
this product as 

(A5)/(x)=a(5/(x)) 

that is to say, we calculate the result Bf (x) first and then operate with A on the result. Using 
the two operators discussed so far we have 


ABf(x) = (x 2 + 1) 


df(x) 

dx 


whilst 


BAf (x) = 2 xf (x) + (x 2 + 1) 


They certainly do not give the same result and it therefore matters which way round we 
apply the operators. If we now define a third operator C whose effect on a function is to 
multiply by x such that 


Cf (x) = xf (x) 


we see that 

BAf (x) = 2Cf (x) + ABf (x) 


This is true for all differentiable functions /(x) and so it forms an operator identity. We 
would write it as 

BA = 2C + AB 


(A.23) 
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In the special case where two operators satisfy the equality 

BA = AB (A.24) 

we say that they commute (you will recognize much of the terminology that we used for 
matrices; once again, that is a different story). Commuting operators play a major role in 
quantum mechanics, as we will shortly see. 

To every linear operator there belongs a special set of functions ufx) and scalars a, such 
that 

Aufx ) = a,M, (x) (A.25) 

The scalars can be real or complex. We say that m,(x) are the eigenvectors (or eigenfunctions ) 
of the operator and a, are the eigenvalues. Depending on the physical circumstances and the 
operator, there may be a finite set of eigenvalues, a countably infinite set or a continuum. 

To finish this section, let me give you an important result from the theory of linear 
operators (together with a simplified proof). 

Theorem 

If two linear operators commute, then it is possible to find a set of simultaneous 
eigenfunctions. 

Proof (simplified version) 

Suppose that two operators have each a single eigenvector and eigenvalue, given by 
Au ( x) = au (x) 

Bv (x) = bv (x) 

If we operate on the first equation by the second operator we find 
BAu (x) = aBu (x) 

but we know that the operators commute and so 

ABu (x) = aBu (x) 


A \hu (x)] = a [Bu (x)] 

The function in square brackets is therefore a linear multiple of the eigenfunction of the 
operator A 

Bu (x) = cu (x) 

which shows that the eigenfunctions of the two operators are the same, apart from some 
arbitrary multiplicative constant. 

Linear operators play an important role in quantum mechanics because: 

• every observable such as position, momentum, energy and angular momentum can be 
represented by a linear operator; 

• the results of measurements of these observables are given by the eigenvalues of the 
operators; 

• observables can be measured simultaneously to arbitrary precision only when their 
operators commute. 
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We usually give some concrete form to the linear operators, especially those repres¬ 
enting position and linear momentum. In the Schrodinger representation we substitute as 
follows. 

• For a position operator such as x we write x. 

• For a momentum operator such as p x we substitute 

_.h 3 
^ 2n dx 

A little operator analysis shows that 

xp x -p x x=j^~ (A.26) 

2n 

and this is consistent with the famous Heisenberg uncertainty principle. 


A.9 Angular Momentum Operators 


The classical angular momentum vector of a particle with position vector r and linear 
momentum p is 

Urxp (A.27) 

In classical mechanics, we can measure the three components and the square of the length, 
l x , ly, 4 and Z 2 = (4 2 + l y 2 + Z z 2 ), simultaneously with arbitrary precision. Things are far 
more interesting for quantum mechanical systems. 

Expanding the vector product Equation (A.27) we see that the x component of 1 is 

4 = yPz ~ zp y 

The corresponding operator is 

4 = Wz - ipy 


which becomes, in Schrodinger representation 



After some operator algebra it can be shown that the various components of the angular 
momentum operator do not commute with each other and so cannot be measured simultan¬ 
eously to arbitrary precision. They are subject to Heisenberg’s uncertainty principle. We 
find in addition that the square of the magnitude and any of the three components can be 
measured simultaneously to arbitrary precision. We conclude that at best we can simultan¬ 
eously measure the square of the magnitude and one component, which is by convention 
called the z component. 

The eigenvalues and eigenfunctions of angular momentum can be found most easily in 
spherical polar coordinates 

z =_AA 

z ^ 2n d<f) 

„ h 2 ( d 2 3 1 3 2 \ 


(A.28) 
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This problem is tackled in detail in all the traditional quantum chemistry texts, such as 
Eyring et al. (1944), and we can write 


{e,<)>) = l(l+l)^-Y lJni (0,0) 
h 

lY,, mi (6,4>)=m l ^Y, mi (0,0) 


(A.29) 


The eigenfunctions are well known from mathematical physics, and they are called 
spherical harmonics. The quantum numbers l and m t are restricted to certain discrete 
values, l = 0, 1, 2, 3,... and for each value of /, m, can take integral values ranging from 
-l to +/. 

We visualize angular momentum as follows. The vector precesses round the z axis in 
such a way that its z component is constant in time, but that the x and y components vary. 
There are 21 + 1 possible orientations of the vector. I have illustrated this for the case Z = 1 
in Figure A. 12, where just one of the three possible vector alignments is shown. 





Figure A. 12 Angular momentum with I = 1 


For each orientation the vector precesses around the axis keeping the z component fixed at 



The length of the vector is 



A.10 Problem Set 

A set of problems, together with solutions, is available on the associated website. 

Reference 

Eyring, H., Walter, J. and Kimball, GE. (1944) Quantum Chemistry, John Wiley & Sons, Inc., 
New York. 
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AMI 

Amber 

B3LYP 

Austin model 1 

Assisted model building and energy refinement 

Becke 3-parameter/Lee, Yang and Parr exchange-correlation 
functional 

CASSCF 

CID 

CIS(D) 

Complete active space self-consistent field 

Configuration interaction with double excitations 

Configuration interaction with single excitations (doubles added 
as perturbation) 

CNDO 

CNDO(S) 

EHT 

HF-LCAO 

G n 

GTO 

GVB 

HOMO 

IMOMM 

IMOMO 

INDO 

IRC 

KS-LCAO 

L-J 

LUMO 

MINDO 

MM 

MNDO 

ONIOM 

Complete neglect of differential overlap 

Complete neglect of differential overlap (spectroscopic version) 

Extended Hiickel theory 

Hartree-Fock linear combination of atomic orbitals 

Gaussian n (for accurate thermodynamic calculations) 

Gaussian-type orbital 

Generalized valence bond 

Highest occupied molecular orbital 

Integrated molecular orbital + molecular mechanics 

Integrated molecular orbital + molecular orbital 

Intermediate neglect of differential overlap 

Intrinsic reaction coordinate 

Kohn-Sham linear combination of atomic orbitals 

Lennard-Jones potential 

Lowest unoccupied molecular orbital 

Modified intermediate neglect of differential overlap 

Molecular mechanics 

Modified neglect of differential overlap 

Our own /V-layered integrated molecular orbital and molecular 
mechanics 

OPLS 

PM3 

Optimized potentials for liquid simulations 

Parameterized model 3 
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PPP 

QCISD 

QSAR 

SAMI 

SCF 

SCRF 

STO 

UFF 

ZDO 

ZINDO/1 

ZINDO(S) 


Pariser-Parr-Pople 

Quadratic configuration interaction with singles and doubles 

Quantitative structure and activity relationships 

Semi at initio model 1 

Self-consistent field 

Self-consistent reaction field 

Slater-type orbital 

Unified force field 

Zero differential overlap 

Zemer’s intermediate neglect of differential overlap 1 
ZINDO for spectroscopy 
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< ... > Mean value/time average 

A Thermodynamic Helmholtz energy 

A, B,... Generic operators 

a 0 Atomic unit of length (the bohr) 

B Wilson B-matrix 

C 6 , C12 Lennard-Jones parameters 

C v , C p Heat capacities at constant volume and pressure 

D(s) Density of states 

D 0 Spectroscopic dissociation energy 

D c Thermodynamic dissociation energy 

d Contraction coefficient (e.g. in STO-nG expansion) 

dr Volume element 

E Electron affinity 

E(r) Electric field (r = field point) 

E b Atomic unit of energy (the hartree) 

F Force 

G Thermodynamic Gibbs energy 

g Gradient vector 

g(r!, r 2 ) Two-electron operator 

H Hessian matrix 

H Thermodynamic enthalpy; classical Hamiltonian 

H Hamiltonian operator 

H v ( $) Hermite polynomial of degree v 

h] Matrix of one-electron integrals in LCAO models 

h m (r i ) One-electron operator 

I, L Ionization energy 

J, K, G Coulomb, exchange, G matrices from LCAO models 
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J( r) Coulomb operator 

j Square root of — 1 

j, J Total angular momentum vector operators 

K( r) Exchange operator 

k s Force constant 

L-J Lennard-Jones (potential) 

1, L Angular momentum vectors 

1, L Orbital angular momentum vector operators 

n Amount of substance; principal quantum number 

n* Effective quantum number 

P(r) Dielectric polarization (r = field point); electron density 

p Pressure 

p e Electric dipole moment 

Q Partition function 

Q a Point charge 

q Normal coordinate; atomic charge; molecular partition function; Mulliken 

atom charge 
q Quaternion 

q e Electric second moment 

R Gas constant 

R Rotation matrix 

R a Position vector 

R t Equilibrium bond length 

R u Rydberg constant for hydrogen 

Roo Rydberg constant for one-electron atom with infinite nuclear mass 

r, R Field point 

S Thermodynamic entropy 

s Shielding constant 

s, S Spin angular momentum vector operators 

U Mutual potential energy 

U, U th Thermodynamic internal energy 

V Volume 

Z Atomic number 

a Electric polarizability 

a GTO orbital exponent; exchange parameter in Xa DFT 

a A Hiickel n -electron Coulomb integral for atom A 

a e Vibration-rotation coupling constant 

Pab Hiickel n -electron resonance integral for bonded pairs A, B 

/f° AB Bonding parameter in semiempirical theories (e.g. CNDO) 

e Energy; orbital energy 

e 0 Permittivity of free space 

s r Relative permittivity 

t, STO orbital exponent 

© e Electric quadrupole moment 

/x Reduced mass 
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P(r) 

Pl(Xi) 

p 2 (xi,x 2 ) 

0 

m 

X 

<P(R, t) 

<p(R!,r 2 , ...; 
<A( r ) 

Electrostatic charge distribution (r = field point) 
One-electron density function 

Two-electron density function 

Total mutual potential energy 

Electrostatic potential (r = field point) 
Electronegativity; basis function in LCAO models 
Time-dependent wavefunction 

1 Many-particle wavefunction 

Orbital (i.e. single-particle wavefunction) 

Angular vibration frequency 

<W e Xe 

Anharmonicity constant 
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